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Preface 


This manual is Volume 2 of the National Language Information and Design 
Guide (NLIDG) series. It provides general information about languages and 
countries 1n the left-to-right and DBCS groups. 


Chapter 1, “Introduction” describes Volume 2. 


Chapter 2, “National Language Descriptions” describes countries and their 
languages. 


Chapter 3, ‘Technical Characteristics” describes the technical characteristics of 
information used in the countries. 


There is an appendix of some of the standards and laws that apply to National 
Language Support (NLS). 


There is a glossary of terms used in Volume 2. 

The other volume of the National Language Information and Design Guide is: 

e Volume |, Designing Enabled Products, Rules and Guidelines, SE09-8001-00, 
describes rules and guidelines for designing products that are enabled for 


national languages. Enabling is an intermediate step between no national 
language support and national language implementation. 
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Chapter 1. Introduction 


This manual gathers in one place the technical information for languages and 
countries. The information in this manual was obtained from the countries 
themselves. It was evaluated and assembled into this manual. At the back 1s a 
reader’s comment form that you are encouraged to use to communicate any 
suggestions and comments. 


Terminology Used in Volume 2 


Whenever possible, this manual uses the names of the countries that are most 
familiar to North American English speakers. However, this is not possible in all 
cases because the countries themselves like to be referred to by their proper 
names, as follows: 


e The German Democratic Republic is referred to as GDR. 

e The Federal Republic of Germany is referred to as FRG. 

e The People’s Republic of China is referred to as PRC. 

e The Republic of China is referred to as ROC. (Taiwan is the old name.) 


Familiarity is also the reason for two abbreviations used in this manual. The 
name United States of America is abbreviated to USA and the name United 
Kingdom is abbreviated to UK. 


Some languages are used in countries other than the country in which the 
language originated. For example, French is used in Belgium, Canada, and 
Switzerland, as well as France. Sometimes there are differences between the 
forms of the language used in the various countries. To distinguish between the 
different forms of a language, this manual assigns the generic name of the 
language to the form used in the country of origin. That is, French is used in 
France, English in the UK, and German is used in GDR, FRG and Austria. 
Canadian French is used in Canada, USA English is used in the USA, and Swiss 
German is used in Switzerland. 
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Language Groupings 
There are three major language groups: 
1. Left-to-right languages using single-byte character sets (SBCS) 
2. Languages using double-byte character sets (DBCS) 
3. Bidirectional languages using SBCS. 


The tables in Chapter 3, “Technical Characteristics” of this volume, are 
structured according to the first two language groups. 
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Chapter 2. National Language Descriptions 


Albania 


This chapter provides information about: 


Countries 

The language (or languages) used in each country 

The percentage of the population that uses each language in the country 
The alphabet (or alphabets) used in each country 


The support that should be provided in countries that have two or more 
languages. 


Albania (Shqipéri) is a southeast European country bordering on the Adriatic 
Sea, Yugoslavia, and Greece.! The capital is Tirané (known as Tirana to English 
speakers). The population of Albania is 3 million. 


Name of the Language 


The language is Albanian. 


Description of the Language 


Albanian is a unique language, different from the languages of other European 
nations. Albanian is written left-to-right, top-to-bottom. 


Percentage of the Population That Uses the Language 


Albanian 100%. 


Alphabet Characteristics 


Albanian uses the Latin alphabet. 


1 


Bordering countries are listed in a north, east, south, and west sequence. 
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Argentina 


Austria 


Argentina is a South American country bordering on Bolivia, Paraguay, Brazil, 
Uruguay, the Atlantic Ocean, and Chile. The capital is Buenos Aires. The 
population of Argentina is 28 million. 

Name of the Language 

The language is Spanish. 

Description of the Language 

Spanish is a Romance language, written left-to-right, top-to-bottom. 
Unique Characteristics of the Language as Used in Argentina 

Argentine Spanish differs from Spanish only in minor ways. 

Percentage of the Population That Uses the Language 

e Spanish 100%. 

Alphabet Characteristics 


Spanish uses the Latin alphabet. 


Austria (Osterreich) is a Central European country bordering on Czechoslovakia, 
Hungary, Yugoslavia, Italy, Switzerland, and the Federal Republic of Germany. 
The capital is Wien (known as Vienna to English speakers). The population of 
Austria is 7.5 million. 

Name of the Language 

The language is German. 

Description of the Language 

German is written left-to-right, top-to-bottom. 

Percentage of the Population That Uses the Language 

e¢ German 100%. 


Alphabet Characteristics 


German uses the Latin alphabet. 


2-2 National Language Information and Design Guide Volume 2 


i Belgium 


Belgium (Belgié or Belgique) is a European country bordering on the 
Netherlands, the Federal Republic of Germany, Luxembourg, and France. The 
capital is Bruxelles (known as Brussels to English speakers). The population of 
Belgium is 10 million. 


Names of the Languages 
The languages are: 

Belgian Dutch (Flemish) 
Belgian French 


German 
English. 


Descriptions of the Languages . 


Belgium has three official languages, Belgian Dutch (also known as Flemish), 
Belgian French, and German, although English is widely used. 


All languages are written left-to-right, top-to-bottom. 


Unique Characteristics of the Languages as Used in Belgium 


Belgian Dutch is similar to the Dutch language spoken in the Netherlands, but 
the vocabulary, code page, and keyboard are different. Dutch documents must 
be translated into Belgian Dutch. 


Also, Belgian French is similar to French, but the vocabulary, code page, and 
keyboard are different. French documents must be translated into Belgian 
French. In Belgian French, English-language abbreviations such as RPG need 
not be translated into French equivalents such as GAP. 


Belgian German is similar to German, but the code page and keyboard are 
different. 


Percentage of the Population That Uses the Languages 


The following numbers indicate the use of the languages in business: 


e Belgian Dutch 55% 
e Belgian French 40% 
e¢ German 5% 
e English (See below.) 


40% of all companies use English in addition to the three official languages. 


Chapter 2. National Language Descriptions 2-3 


Bulgaria 


Canada 


Alphabet Characteristics 

All languages use the Latin alphabet. 
Support of More than One Language 
English nomenclature is required. 


Keyboards must use nonescaping (dead) keys to generate accents. 


Bulgaria (Balgarija) is a southeast European country bordering on Romania, the 
Black Sea, Turkey, Greece, and Yugoslavia. The capital is Sofija (known as Sofia 
to English speakers). The population of Bulgaria is 9 million. 

Names of the Languages 

The language is Bulgarian, but some Turkish is also used unofficially. 


Description of the Language 


Bulgarian is a Slavic language and uses Cyrillic characters, written left-to-right, 
top-to-bottom. 


Percentage of the Population That Uses the Languages 


e Bulgarian 90% 
e Turkish 10%. 


Alphabet Characteristics 


Bulgarian uses the Cyrillic alphabet. 


Canada is the second-largest country in the world, bordering on the Arctic Ocean, 
Atlantic Ocean, the USA, and the Pacific Ocean. Canada contains ten provinces 

and two territories. The capital is Ottawa, located in the province of Ontario. The 
population of Canada is 25 million. 


Names of Languages 
The languages are: 


e Canadian English 
e Canadian French 


2-4 National Language Information and Design Guide Volume 2 


Descriptions of the Languages 

Canada has two official languages, English and French. Canadian French is the 
primary language of the province of Quebec. Both Canadian French and 
Canadian English are used in the province of New Brunswick. Canadian English 
is the primary language of the other provinces and territories. 


Both languages are written left-to-right, top-to-bottom. 


Unique Characteristics of the Languages as Used in Canada 


Canada generally uses the United Kingdom spellings of English words, although 
United States spellings are accepted in publications. 


Canadian French has evolved slightly differently from the French in France. One 
difference shows in the accenting of uppercase vowels in Canadian French. Until 
recently, this accenting was not done in France, but it is emerging as a 
requirement. 
An accent can change the meaning of a word. For example: 

«Porc salé» means “Salted pork.” 


«Porc sale» means “Dirty pig.” 


For quotation marks, Canadian English uses a double, comma-like symbol 
(“ and ”). Canadian French users prefer angled quotation marks (« and »). 


Percentage of the Population That Uses the Languages 


e = English 10% 
e French 20% 
e Both 10%. 


Alphabet Characteristics 
Both Canadian English and Canadian French use the Latin alphabet. 
Support of More than One Language 


Users should be able to choose one of the two official languages at logon or in 
the user profile. Users should also be able to choose to have the Canadian 
French keyboard emulated on a keyboard that does not have Canadian French 
characters. 


Canadian French keyboards and graphic character sets are different from those 
for French and from those for Canadian English. For example, France uses 
AZERTY keyboards. Canada uses QWERTY keyboards. Canadian English 
keyboards do not have accents; Canadian French keyboards do. 
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Czechoslovakia 


Denmark 


Czechoslovakia (Ceskoslovensko) is a central European country bordering on 
Poland, the USSR, Hungary, Austria, the Federal Republic of Germany, and the 
German Democratic Republic. The capital is Praha (known as Prague to English 
speakers). The population of Czechoslovakia is 15 million. 

Names of Languages 


The languages are: 


e Czech 

e Slovak 

e Hungarian (unofficially). 
Descriptions of the Languages 


Both Czech and Slovak are Slavic languages and are written left-to-right, 
top-to-bottom. 


Percentage of the Population That Uses the Languages 


@e Czech 60% 
e Slovak 33% 
e Hungarian T%. 


Alphabet Characteristics 


Czech and Slovak both use the Latin alphabet. 


Denmark (Danmark) is a Scandinavian country bordering on the North Sea, the 
Baltic Sea, and the Federal Republic of Germany. The North Sea separates 
Denmark from Norway. The Baltic Sea separates Denmark from Sweden on the 
northeast side and from the German Democratic Republic on the southeast side. 
Its capital is Kobenhavn (known to English speakers as Copenhagen). The 
population of Denmark is 5 million. 


Name of the Language 
The language is Danish. 
Description of the Language 


Danish is a Germanic language and has some similarities with German, 
Norwegian, and Swedish. Danish is written left-to-right, top-to-bottom. 
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Finland 


Percentage of the Population That Uses the Language 


e Danish 100%. 


Alphabet Characteristics 


Danish uses the Latin alphabet. 


Finland (Suomi) is a Scandinavian country, bordering on Norway, the USSR, the 
Baltic Sea, and Sweden. The capital is Helsinki. The population of Finland 1s 5 
million. 

Names of Languages 

The main language is Finnish, but Swedish is also spoken. 

Descriptions of the Languages 

Finnish is a unique language whose vocabulary and grammar have few similarities 
to neighboring major language groups, such as the Germanic languages of 
Norway, Sweden, and Denmark and the Slavic languages of the European part of 
the Soviet Union. The Finnish language is more closely related to the Turkish 
and Hungarian languages than to the Germanic and Slavic languages. 

The Swedish used in Finland is identical to that used in Sweden. 

Finnish and Swedish are written left-to-right, top-to-bottom. 


Percentage of the Population That Uses the Languages 


e Finnish 94% 
e Swedish 6%. 


Alphabet Characteristics 
Both Finnish and Swedish use the Latin alphabet. The letters 4, A, 4, A, 6, and 


O are an integral part of the alphabet and are not considered to be accented 
letters. In the alphabet sequence, these letters follow the letter z. 


Chapter 2. National Language Descriptions 2-7 


France 


France is a European country bordering on Belgium, Luxembourg, the Federal 
Republic of Germany, Switzerland, Italy, the Mediterranean Sea, Spain, and the 
Atlantic Ocean. The capital is Paris. The population of France is 54 million. 
Name of Language 
The language is French. 
Description of the Language 
French is a Romance language and is written left-to-right, top-to-bottom. 
Unique Characteristics of the Language as Used in France 
IBM France is changing its character set to include accented uppercase vowels. 
An accent can change the meaning of a word. For example: 

‘Le President assassiné” means “the assassinated president.” 

‘Le President assassine” means “the president assassinates.” 
Percentage of the Population That Uses the Language 
e French 100%. 


Alphabet Characteristics 


French uses the Latin alphabet. 


German Democratic Republic (GDR) 


GDR (DDR - Deutsche Demokratische Republik) is a central European country 
bordering on the Baltic Sea, Poland, Czechoslovakia, and the Federal Republic of 
Germany. The capital is Ost Berlin (known to English speakers as East Berlin). 
The population of GDR is 17 million. 

Name of the Language 

The language is German. 

Description of the Language 


German is written left-to-right, top-to-bottom. 
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Percentage of the Population That Uses the Language 
e German 99% 

e Others 1% (Serbian). 

Alphabet Characteristics 


German uses the Latin alphabet. 


Germany, Federal Republic of (FRG) 


Greece 


FRG (Bundesrepublik Deutschland) is a central European country bordering on 
the North Sea, Denmark, the Baltic Sea, the German Democratic Republic, 
Czechoslovakia, Austria, Switzerland, France, Luxembourg, Belgium, and the 
Netherlands. The capital is Bonn. The population of FRG is 61 million. 
Name of the Language 

The language is German. 

Description of the Language 

German is written left-to-right, top-to-bottom. 

Percentage of the Population That Uses the Language 

¢ German 100%. 


Alphabet Characteristics 


German uses the Latin alphabet. 


Greece (Ellas) is an eastern European country bordering on Albania, Yugoslavia, 
Bulgaria, Turkey, the Aegean Sea, and the Mediterranean Sea. The capital is 
Athinai (known as Athens to English speakers). The population of Greece is 10 
million. 


Name of the Language 
The language is Greek. The written form is New Hellenic, which is a 


combination of the “formal” and “popular” forms of Greek used until 1982. 
Formal Greek used many accents and was closest to the ancient Greek language. 
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Hungary 


Iceland 


Description of the Language 

Greek is a native language for Greece and Cyprus. 
New Hellenic is wnitten left-to-right, top-to-bottom. 
Percentage of the Population That Uses the Language 
e Greek 100%. 

Alphabet Characteristics 


Greek uses the Greek alphabet. 


Hungary (Magyarorszag) is a central European country bordering on 
Czechoslovakia, USSR, Romania, Yugoslavia, and Austria. The capital is 
Budapest (actually two cities, Buda and Pest). The population of Hungary is 10.5 
million. 

Name of the Language 

The language is Hungarian, which is a unique language. Hungarian is more 
closely related to Turkish and Finnish than to the Slavic languages of Hungary’s 
neighbors. 

Description of the Language 

Hungarian is written left-to-right, top-to-bottom. 

Percentage of the Population That Uses the Language 

e Hungarian 100%. 

Alphabet Characteristics 


Hungarian uses the Latin alphabet. 


Iceland is an island in the North Atlantic Ocean, located between Greenland, 
Scotland and Norway, situated entirely South of the Arctic Circle. The capital of 
Iceland is Reykjavik, where about half the population lives. The total population 
is a quarter of a million. 


2-10 National Language Information and Design Guide Volume 2 


Name of Language 

The language is Icelandic. 

Description of the Language 

Icelandic is a Germanic language that has, for the most part, retained its written 
form for the past eight centuries. It is related to Norwegian, Danish and Swedish. 
The language is written left-to-right and top-to-bottom. 

Unique Characteristics of the Icelandic Language 

Icelandic does not adopt foreign words easily, so new Icelandic words are often 
made up through compounding and derivations. Compound words are 
frequently used in Icelandic. 

All the vowels (A, E, I, U, O, and Y) can have the acute accent. Consequently, 
dead-key support is required on keyboards. 

Percentage of the Population That Uses the Language 

e Icelandic 100%. 


Alphabet Characteristics 


Icelandic uses the Latin alphabet. 


Italy 


Italy (Italia) is a European country bordering on Switzerland, Austria, 
Yugoslavia, the Adriatic Sea, the Mediterranean Sea, and France. The capital is 
Roma (known as Rome to English speakers). The population of Italy is 57 
million. 

Name of Language 

The language is Italian. 

Description of the Language 


Italian is a Romance langua,e and is written left-to-right, top-to-bottom. 
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Japan 


Percentage of the Population That Uses the Language J 


e Italian 100% 
e German 0.6% 
e French 0.2%. 


Alphabet Characteristics 


Italian uses the Latin alphabet. 


Japan (Nihon) consists of many islands located off the east coast of the Asian 
mainland. The largest of the islands are Hokkaido, Honshu, Shikoku, and 
Kyushu. The capital is Tokyo, located on the island of Honshu. The population 
of Japan is 119 million. 


Name of the Language 

The language is Japanese, which contains three forms of characters: 

e Kanji 

e Katakana J | 
e Hiragana. 

Description of the Language 

Kanji (literally, Chinese characters) is a set of ideographic characters originally 
derived from Chinese characters. Hiragana is set of phonetic characters usually 
used to provide word endings required by Japanese but not provided by Kanji. 
Katakana is another set of phonetic characters that are primarily used to 
represent words of non-Japanese origin. The sound(s) of a Kanji character can 
be represented by Hiragana or Katakana. Note that any piece of written text 
may contain examples of all three forms of Japanese characters. 

Japanese is written in two ways: the traditional way (top-to-bottom, right-to-left) 
and the modern way (left-to-right, top-to-bottom). The Japanese language does 
not use a blank between words. 


Percentage of the Population That Uses the Language 


e Japanese 100%. 
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nd 


The Netherlands 


Alphabet Characteristics 


The following groups of characters are included in the Japanese Graphic 
Character Set: 


Kanji characters 

Katakana characters 

Hiragana characters 

Arabic numerals (0 through 9) 

Latin alphabetic characters (a through z and A through Z) 
Greek characters 

Russian characters. 


The Japanese language itself does not have Latin, Greek, or Russian characters. 
These characters are included in the character set for information interchange 
purposes. 


The Asian languages do not have alphabets. In Japanese, Hiragana and 
Katakana look like alphabets, but they are not really alphabets. They are 
complementary characters, and Kanji is the major character. This is different 
from languages using the Latin alphabet, which only need alphabetic characters 
to compose words. 


The Netherlands (Nederland) is a European country bordering on the North Sea, 
the Federal Republic of Germany, and Belgium. The Netherlands has two 
capitals, the business capital, Amsterdam, and the political capital, Den Haag or 
’s-Gravenhage (known as Hague to English speakers). The population of the 
Netherlands is 14 million. 

Names of the Languages 


The language is Dutch. English, French, German, and Spanish are also used in 
business. 


Description of the Languages 


Dutch is a Germanic language that is written left-to-right, top-to-bottom. 
Unique Characteristics of the Languages as Used in the Netherlands 


Dutch differs slightly from Belgian Dutch (Flemish) in syntax and vocabulary. 
Belgian Dutch documents must be translated into Dutch. 
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New Zealand 


Percentage of the Population That Uses the Languages 


The following numbers indicate the use of the languages in business: 


e Dutch 100% 
e English 80% 
e German 50% 
e French 30% 
e Spanish 5%. 


This list indicates that most business is conducted in two or more languages. 
Alphabet Characteristics 


All languages use the Latin alphabet. 


New Zealand is located in the South Pacific and consists of two main islands. 
The nearest country is Australia to the West. The capital of New Zealand is 
Wellington. The population of New Zealand is 3.3 million. 

Name of Language 


The language is English. 


Description of the Language 


English is a Germanic language that uses the Latin alphabet. 


Unique Characteristics of the Language as Used in New Zealand 

For the most part, the New Zealand spelling of English is identical to the spelling 
in the UK. 

Percentage of the Population That Uses the Language 

e English 100%. 

Alphabet Characteristics 


English uses the Latin alphabet. 
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Norway 


Norway (Norge) is a Scandinavian country bordering on the Norwegian Sea, the 
Barents Sea, the USSR, Finland, and Sweden. Its capital is Oslo. The 
population of Norway is 4 million. 

Name of the Language 

The language is Norwegian. 


Description of the Language 


Norwegian is a Germanic language and has some similarities with German, 
Danish, and Swedish. Norwegian is written left-to-nght, top-to-bottom. 


Percentage of the Population That Uses the Language 
e Norwegian 100%. 
Alphabet Characteristics 


Norwegian uses the Latin alphabet. 


People’s Republic of China 


The People’s Republic of China (PRC) 1s located in eastern Asia and borders on 
the USSR, Mongolia, Korea, the Yellow Sea, East China Sea, South China Sea, 
Vietnam, Laos, Burma, India, Bhutan, Sikkim, Nepal, Jammu, and Kashmir. 
The capital is Beijing (formerly known as Peking). The population of PRC 1s 
over 1.1 billion. 


Name of the Language 
The language is Chinese. 
Description of the Language 


Written Chinese is made of ideographic characters, known as Hanzi, which were 
originally invented in China. Some Hanzi are also used in PRC’s neighboring 
countries, such as Japan (known as Kanji) and Korea (known as Hanja). While 
most Hanzi still remain in their traditional form, in PRC however, many Hanzi 
have been simplified by the government to make them easier to use. Hence the 
written Chinese used in the PRC is generally referred to as Simplified Chinese. 


Chinese is written in two orientations: the traditional way (top-to-bottom, 
right-to-left) and the modern way (left-to-right, top-to-bottom). The Chinese 
language does not use a blank between words, except that blanks are sometimes 
used in front of a person’s name to show respect. 
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Poland 


Portugal 


PRC uses the Arabic numeric system and the Chinese numeric system. The 
Arabic numeric system is used for data processing purposes. 


Percentage of the Population That Uses the Language 
e Chinese 100%. 
Alphabet Characteristics 


The Chinese language does not have an alphabet. 


Poland (Polska) is a central European country bordering on the Baltic Sea, the 
USSR, Czechoslovakia, and the German Democratic Republic. The capital is 
Warszawa (known as Warsaw to English speakers). The population of Poland ts 
35 million. 

Name of the Language 

The language is Polish. 

Description of the Language 

Polish is a Slavic language and is written left-to-right, top-to-bottom. 

Percentage of the Population That Uses the Language 

¢ Polish 100%. 


Alphabet Characteristics 


Polish uses the Latin alphabet. 


Portugal is a European country bordering on Spain on the north and east, and 
the Atlantic Ocean to the south and west. The capital is Lisboa (known as 
Lisbon to English speakers). The population of Portugal is 10 million. 

Name of the Language 


The language is Portuguese. 
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Description of the Language 
Portuguese is a Romance language and is also spoken in: 


Angola 

Brazil 

Cape Verde 

Guinea Bissau 
Mozambique 

Sao Tome and Principe. 


Unique Characteristics of the Language as Used in Portugal 


There are only minor differences between Portuguese used in Portugal and 
elsewhere. 


As in French and Spanish, accented letters change the meaning of Portuguese 
words. 


Percentage of the Population That Uses the Language 
e Portuguese 100%. 
Alphabet Characteristics 


Portuguese uses the Latin alphabet. 


Republic of China 


The Republic of China (ROC) is located on the island of Taiwan off the 
southeast coast of matnland China. The Portuguese name for the island is 
Formosa. The capital is Taipei. The population of ROC is 20 million. 


Name of the Language 

The language is Chinese. 

Description of the Language 

Written Chinese is made of ideographic characters, known as Hanzi, which were 
originally invented in China. While the PRC has simplified many Hanzi, the 
Hanzi used in the ROC still remain in their traditional form and hence are 
generally referred to as Traditional Chinese. 

Chinese is written in two orientations: the traditional way (top-to-bottom, 
right-to-left) and the modern way (left-to-right, top-to-bottom). The Chinese 


language does not use a blank between words, except that blanks are sometimes 
used in front of a person’s name to show respect. 
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ROC uses the Arabic numeric system and the Chinese numeric system. The J 
Arabic numeric system is used for data processing purposes. 


Percentage of the Population That Uses the Language 
e Chinese 100%. 
Alphabet Characteristics 


The Chinese language does not have an alphabet. 


Republic of Korea (South Korea) 


South Korea (Tachan-Min’guk) is located in the south part of the Korean 
peninsula on the east coast of the Asian mainland. The capital is Seoul. The 
population of South Korea is 41 million. 


Name of the Language 
The language is Korean, which contains two forms of characters: 


e Hangeul 


e = Hanja. } 


Description of the Language 


Hangeul is a set of phonetic characters formed from components, called Jamo, 
which represent consonants and vowels. One phonetic syllable, a Hangeul 
character, can consist of an initial consonant and vowel combination, or of an 
initial consonant, vowel, and final consonant combination. There are 52 Jamo 
components. Among them, 10 are basic vowels and 14 are basic consonants. 
Some consonants are composed of two basic consonants, and some vowels are 
composed of up to three basic vowels. There are 21 vowels and 31 consonants, 
19 of which can be initial consonants and 28 can be final consonants. In total 
there are 11,571 (19 x 21 x (28 + 1)) Hangeul characters; each consists of two to 
seven basic Jamo components. Far fewer characters are in actual use. 


The principal form of Korean characters is Hangeul. Hanja is mainly used to 
represent people’s names and terminologies in science and education. The use of 
Hanja is declining. 

Korean is written left-to-right, top-to-bottom, but the traditional way 


(top-to-bottom, right-to-left) is still used in newspapers. 


Percentage of the Population That Uses the Language 


® Korean 100%. | 5 
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Alphabet Characteristics 

The Korean language does not have an alphabet. In Korean, Jamo looks like an 
alphabet but is not really an alphabet. There are no Jamo characters; rather there 
are Jamo components, which are used to compose a Hangeul character. This 


structure is different from languages using the Latin alphabet, which only need 
alphabetic characters to compose words. 


Romania (Rumania) 


Romania (Romania) is a southeast European country bordering on the USSR, 
the Black Sea, Bulgaria, Yugoslavia, and Hungary. The capital is Bucuresti 
(known to English speakers as Bucharest). The population of Romania is 22 
million. 

Names of Languages 

The language is Romanian but, Hungarian and German are also spoken. 


Description of the Language 


Romanian is a Romance language that is different from, but heavily influenced 
by, the Slavic languages of its neighbors. 


Romanian is wnitten left-to-nght, top-to-bottom. 


Percentage of the Population That Uses the Languages 


e Romanian 90% 
¢ Hungarian 8% 
e German 1% 
e Others 1%. 


Alphabet Characteristics 


Romanian uses the Latin alphabet. 


South Africa 


South Africa (Suid Africa) is the southernmost country in Africa. The capitals are 
Pretoria (administrative) and Cape Town (legislative). The population of South 
Africa is 30 million. 
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Spain 


Names of Languages 
The languages are: 


e English 
e Afrikaans. 


Descriptions of the Languages 


South Africa uses two languages, English and Afrikaans. Originally, Afrikaans 
was a variation of the Dutch spoken by the Boers who settled in South Africa in 
the seventeenth century. Now, the language has assimilated other languages to 
become a separate language, not just a variation of Dutch. 


Both languages are written left-to-right, top-to-bottom. 


Unique Characteristics of the Languages as Used in South Africa 
South Africa generally uses the United Kingdom spellings of English words. 
Percentage of the Population That Uses the Languages 


e English 50% 
e Afrikaans 50%. 


Alphabet Characteristics 
Both English and Afrikaans use the Latin alphabet. 
Support of More than One Language 


South Africa requires a common character set for English and Afrikaans. 
Keyboards require nonescaping (dead) keys input for accented characters. South 
Africa requires separate linguistic aids (dictionary, spelling aid, and so on) for 
English and Afrikaans. 


Spain (Espafia) is an European country bordering on the Atlantic Ocean, France, 
Andorra, the Mediterranean Sea, and Portugal. It shares the Iberian Peninsula, 
the westernmost part of mainland Europe, with Portugal. The capital is Madrid. 
The population of Spain is 39 million. 
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Names of Languages 
The languages are: 


Castillian 
Catalan 
Gallego 
Basque. 


Descriptions of the Languages 

Castillian is the language commonly known as Spanish. Catalan is a language 
mainly used in eastern Spain (Catalonia). Gallego is a language mainly used in 
northwestern Spain (Galicia). Basque is a unique language, unlike any other 
European language, and is spoken in the northeast of Spain, along the border 
with France. 

All four languages are written left-to-nght, top-to-bottom. 

Unique Characteristics of the Languages as Used in Spain 


Castillian is spoken in Central and South America although some words used in 
Spain are not used in Central and South America. 


An accent can change the meaning of a word. For example: 
‘‘Mascara” means “(He) will chew.” 


“Mascara” means “Mask.” 


Percentage of the Population That Uses the Languages 


e § Castillian 100.0% 
e 6 Catalan 25.0% 
e Gallego 7.5% 

e Basque 6.0%. 


Note: All Spaniards use Castillian, and some also use Catalan, Gallego, or 
Basque. 


Alphabet Characteristics 
All languages use the Latin alphabet. 
Support of More than One Language 


All four languages are supported in the standard keyboard and character set and 
so can be represented on displays and printers. 


Support of Castillian is required for customer setup information (CSU), machine 
readable information (MRI), documentation, and linguistic sensitive functions 
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Sweden 


Switzerland 


(LSF), such as Spanish dictionaries, spelling aid, automatic hyphenation, and 2 
synonym aid. Support of Catalan is required for LSF such as dictionaries, 

spelling aid, automatic hyphenation, and, at a lower priority, synonym aid. MRI 

enabling for Catalan, Gallego, and Basque is part of the global MRI language 

enabling. Gallego and Basque also require LSF enabling. 


Sweden (Sverige) is a Scandinavian country bordering on Norway, Finland, and 
the Baltic Sea, which separates Sweden on the southwest side from Denmark. Its 
capital is Stockholm. The population of Sweden is 8 million. 

Name of Language 

The language is Swedish. 

Description of the Language 

Swedish is written left-to-nght, top-to-bottom. 

Percentage of the Population That Uses the Language 

e Swedish 100%. S 
Alphabet Characteristics 

Swedish uses the Latin alphabet. The letters 4, A, 4, A, 6, and O are an integral 


part of the alphabet and are not considered to be accented letters. In the 
alphabet sequence, these letters follow the letter z. 


Switzerland (Schweiz, Suisse, or Svizzera) is a European country bordering on the 
Federal Republic of Germany, Austria, Italy, and France. The capital is Bern. 
The population of Switzerland is 6 million. 


Names of Languages 
The languages are: 

e Swiss German 

e Swiss French 


e Swiss Italian 


Another language, called Rhaeto-Romansch, is spoken in areas adjacent to the 
Italian border by a small minority in several dialects. ; 
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Thailand 


Descriptions of the Languages 

Swiss German is the language most in use. French and Italian are also used. 
All three languages are written left-to-right, top-to-bottom. 

Unique Characteristics of the Languages as Used in Switzerland 

Swiss German is slightly different from German. For example, the sharp s (B) 1s 
not used in Swiss German, and several words have different genders in Swiss 


German. 


There are only minor differences between Swiss French and French and between 
Swiss Italian and Italian. 


Percentage of the Population That Uses the Languages 
Swiss German 65% 
Swiss French 18% 


e 
e 
e Swiss Italian 10% 
e Others 71%. 
Alphabet Characteristics 


All languages use the Latin alphabet. 


Thailand (Prathet Thai), formerly known as Siam, is in Southeast Asia. The 
capital is Krung Thep (known as Bangkok to English speakers). The population 
of Thailand is 52 million. 

Names of Languages 


The official language is Thai. English is also widely used. Chinese is used 
occasionally. 


Description of the Language 
Thai is a phonetic language containing consonants, tone marks, vowels (lower, 
middle, and upper), and numerals. Thai has its own numerals, but Arabic 


numerals are used for data processing. 


Vowels are classified as upper, middle, and lower vowels depending on the 
printing position on the print line, in relation to the consonant character. 


Thai is written left-to-right, top-to-bottom. 
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Turkey 


Percentage of the Population That Uses the Languages 


The following table shows the percentage of languages used in business because 
these values are not necessarily reflected in the percentage of the total population 
that uses each language. 


Local International 
Government Banking Commerce Commerce 


English 30 60 40 80 
Thai 70 35 60 20 
Chinese 0 5 0 0 


Alphabet Characteristics 


Thai alphabets are used to form words similar to the way the Latin alphabet 1s 
used to form words. One word can consist of one to five syllables. Syllables are 
composed using consonants, vowels, and tone marks. Middle vowels are always 
at the same printing height as consonants. Upper vowels, lower vowels, and tone 
marks are used to compose a word or syllable in slim form. Words are run 
together using proportional space, so there is no break (blank) between syllables, 
or even words. The only breaks come at the end of sentences and to separate 
foreign words. 


Turkey (Turkiye) is partly European and partly Asian. It borders on Bulgaria, 
the Black Sea, the USSR, Iran, Iraq, Syria, the Mediterranean Sea, and Greece. 
The capital is Ankara. The population of Turkey is 52 million. 

Name of the Language 

The language is Turkish. 

Description of the Language 

Turkish is a unique language, which is also spoken in parts of: 

Tran 

Greece 


Bulgaria 
The USSR. 


Percentage of the Population That Uses the Language 
¢ Turkish 100%. 
Alphabet Characteristics 


Turkish uses the Latin alphabet. 
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United Kingdom 


The United Kingdom (UK) consists of England, Scotland, Wales, and Northern 
Ireland. The capital is London. The population of the UK is 56 million. 


Names of Languages 


The primary language is English, although Welsh is spoken as the first language 
in some parts of Wales. 


Description of the Language 


English is a Germanic language that uses the Latin alphabet. 


Unique Characteristics of the Language as Used in the UK 


The UK uses spellings different from US English spellings for some words. 


Percentage of the Population That Uses the Language 
© English 100%. 
Alphabet Characteristics 


English uses the Latin alphabet. 


United States of America 


The United States of America (USA) consists of parts of North America and the 
Hawauan Islands. The USA is largely bounded by Canada in the North and by 
Mexico in the South, with the state of Alaska bounded by Canada in the South 
and to the East. The capital of the USA is Washington D.C. The population of 
the USA is 236 million. 

Names of Languages 


The official language is English, although Spanish is spoken by some segments of 
the population. 


Description of the Language 


English is a Germanic language that uses the Latin alphabet. 


Chapter 2. National Language Descriptions 2-25 


Yugoslavia 


| 
Unique Characteristics of the Language as Used in the USA 2 


UK English and USA English use different spellings for some words. 


Percentage of the Population That Uses the Language 
e English 100%. 
Alphabet Characteristics 


English uses the Latin alphabet. 


Yugoslavia (Jugoslavija) is a southeast European country bordering on Austria, 
Hungary, Romania, Bulgaria, Greece, Albania, the Adriatic Sea, and Italy. The 
capital is Beograd (known as Belgrade to English speakers). The population of 
Yugoslavia is 22 million. 


Names of Languages 


The languages are: | ) 


e Serbo-Croatian 

e Slovenian 

e Macedonian 

e Albanian 

e Hungarian and others. 


Descriptions of the Languages 


Serbo-Croatian, Slovenian, and Macedonian are Slavic languages. 
Serbo-Croatian is a spoken language that is written two ways: Serbian uses Latin 
and Cyrillic characters, while Croatian uses only Latin characters. Both are 
written left-to-right, top-to-bottom. 


e Serbian 45.0% 

e Croatian 27.0% 

e Albanian 9.0% 

e Slovenian 7.7% 

e Macedonian 7.7% 

e Hungarian 2.5% 

e Others 1.1%. 

Alphabet Characteristics 2 


Serbian and Macedonian use Cyrillic characters. All others use Latin characters. 
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Chapter 3. Technical Characteristics 


This chapter describes technical information on: 

e Sort sequences, monocasing, and character classification 
e Keyboards 

e Currency 

e Number Rounding 

e Number Notation 

e Calendars 

e Format of the date and time 


e Weights and measurements systems. 


Sort Sequence, Monocasing, and Character Classification 


This section describes how different countries use the alphabet characters 
belonging to their country language(s). In particular, this section provides 
detailed information on: 


e The sequence used by each country to sort in ascending order 
e Converting to single case (also known as monocasing) 
e Using special characters as alphabetic or numeric. 


The information on sort sequences, monocasing, and character classification is 


character set independent and applies equally to all the code pages (such as 
EBCDIC and ASCII) of a given country. 
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Sort Sequence Considerations 


Figure 3-1 shows a common ascending-order sort sequence. A note in the 
character tables of countries using this sequence will refer to Figure 3-1. Other 
countries use a different sort sequence, which is shown in their character tables. 


iN 
Spaces 
Characters given a sort weight between spaces and alphabetic 
characters (listed for each country) 


Alphabetic characters (sorted as shown in the following tables) 
Numeric characters (sorted in numeric sequence) 

Special characters (sorted by code point hexadecimal or binary 
sequence). 


Figure 3-1. General Sort Sequence 


Characters with Sort Weight between Spaces and Alphabetic Characters 


Latin Alphabet Number 1 


Figure 3-1 indicates that some special characters are given a “sort weight between 

spaces and alphabetic characters.” These characters help decide the position of 

where the sorted data ends after the sort operation. They can be in a binary or 

hexadecimal sequence among themselves, but ultimately their final sort weight 1s 

between spaces and alphabetic characters. The following are samples of these 2 
special characters: 


e The hyphen in “Wynne-Roberts” will cause this name to be sorted just after 
“Wynne Rogers.” 


e The apostrophe in O’Brian will cause this word to be sorted ahead of any 
other OBrian that does not have the apostrophe. 


e The Portuguese character string “De” at the beginning of a sort field and 
followed by a space in “De Souza” will cause this name to be sorted as if it 
was written “Souza.” 


Latin Alphabet Number 1 (LAN1) is an ISO character set used for English, 
German, French, and numerous other languages. LANI may be a superset of 
the character sets specified in some countries’ sort sequences. If this is so, the 
LANI characters not contained in such a sort sequence could be handled in a 
consistent manner as follows: 


1. In the case where the regular nonaccented character is present on the country 
sort sequence, but accented characters or special forms of the character are 
not present, then these latter ones should follow the regular nonaccented 


character. ) 
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C 2. Inthe case where neither the regular nonaccented character, nor the accented 
characters or special forms of the character are given on the country sort 
sequence, then such characters should be placed according to the Multilingual 
Code Page (MLP) sort sequence. 


MLP Sort Sequence 


The Multilingual Code Page (MLP) sort sequence is the one used by FRG and 
Switzerland. Both of these countries have the same sort sequence, which is based 
on the MLP. 


Some other countries have indicated that they could use an alternative sort 
sequence based on the MLP, the one used by FRG and Switzerland. But, this 
alternative sort sequence is not to be regarded as a substitute for these countries’ 
main sort sequence. 


Sort Exits 


Sort programs need to provide user sort exits to allow for customizing and 
support of countries that have alternative sort sequences. 


Monocasing Considerations 


The monocase rules given for the various countries are to be fullowed wheii 
providing the Caps Lock function. This function is defined on page 3-90. 


Character Classification Considerations 


The classification of special characters into alphabetic characters and numeric 
characters does not apply to sort sequences. It applies to software programs that 
have to determine if a string of data is alphabetic or numeric. Such a 
classification 1s more commonly required after the user has manipulated the string 
of data. For example, the user has entered data in a screen input field containing 
currency figures, and the program has defined this field to contain only numeric 
data. 


Character Table Headings 
The following character tables show the alphabetic-character sort sequences for 
the countries, monocasing information, and character classification. Figure 3-2 is 
an example of the heading for the tables, as follows: 
e The sequence number (item A in Figure 3-2) 


e The graphic character (item B) 


e The Character Identifier (item C) used in IBM Corporate Standards, which 
i follows closely the character identifier used in ISO 6937/2, Annex D 


e The monocase character, if only uppercase is used (item D) 


Chapter 3. Technical Characteristics 3-3 


The character classification (A is alphabetic, N is numeric, and § is special) J 
(item E) 


e Notes defined with each table (item F). 


Seq Char ID Mono Alpha Note 


Num case /Num 
(A) (B) (C) (D) (—) _ (F) 


Figure 3-2. Character Table Heading 
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Cc Albania 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required or syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


Character Table 


Figure 3-3 shows the character table for Albania. The table lists the characters in 
ascending order with the lowest-weighted character listed first. 
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Seq Char ID 


Mono Alpha Note 


Num case /Num 
l a LAO] LA02 A 
l A LAO2 A 
2 b LBO1 LBO2 A 
2 B LBO02 A 


a 


A 
¢ LC41 Lc42 A 
C LC42 A 
d 


LDO1 LD0O2 A 
LD02 


an 
s) 
> 


6 e LEO! LEO2 A 
6 BE LE02 A 
7  €&  LEI7 LEB A 
7  £E LEIS A 
8 f  LFOl LFO2 A 
8 F  LFO2 A 
9 g  LGOl LGO2 A 
9 G_ LG02 A 
10 h  LHOl LHO2 A 
10 H  LH02 A 
11 i  LI0l Lio2 A 
11 I Lio2 A 
12 j Lol Loz A 
12 J Lyo2 A 

k LKOl LKO2 A 
13 K  LK02 A 
14 1 LLOL LLO2 A 
14 L  LLO2 A 
15 m  LMOl LM02 A 
15 M_ LMo2 A 


Figure 3-3. Character Table for Albania 


Seq Char ID 


Mono Alpha Note 


Num case /Num 
16 n LNO! LNO2 A 
16 N  LNO2 A 
17 fe) LOO! LOO02 A 
17 O LOO02 A 
18 P LPO! LP02 A 
18 P LP0Q2 A 
19 q LQ01 LQ02 A 
19 Q = LeQ02 A 
20 re LRO!1 LRO2 A 


20 R  LRO2 A 
21 § LSO1 LS0O2 A 
21 S LS02 A 
22 tt LTO!1 LYT02 A 
22 T  LT02 A 
23 u LUO! LU02 A 


23 U LU02 A 


24 Vv LV0O1 LV02 A 
24 Vv =LV02 A 


25 w  LWOI LW02 A 
25 Ww LW02 A 


26 x LX01 LX02 A 
2 X  LXx02 A 
27 +y ~~ LYOl LY02 A 
27 Y  LyY02 A 
28 z  LZ0l LZ02 A 
22 Z LZ02 A 


All other characters are sorted as 


shown in Figure 3-1 on page 3-2. 
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C 


Argentina 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (if it has a letter ahead of and behind it, as in “‘s’havia’’) (SPOS). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following character strings should be given a sort weight between spaces and 
alphabetic characters, if they are at the start of a sort field and if they are 
immediately followed by a space: 


e d’, D’, de, De, del, and Del 
e el and El 
e 1’, L’, la and La. 


Character Table 


Figure 3-4 shows the character table for Argentina. The table lists the characters 
in ascending order with the lowest-weighted character listed first. In Argentina, 
some characters are regarded as equivalents and are listed in the table with equal 
weights. After a sort, these characters can be intermixed like, for example, the 
following words: 


Unsorted list Sorted list 
candor candido 
casa candor 
Cantera cantaro 
Castro Cantera 
candido casa 
cantaro cascara 
cascara Castro 
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Seq Char ID Mono Alpha Note 


Seq Char ID Mono Alpha Note 
Num case /Num 


Num case /Num 


] a LAO! LAQ2 A 11 ] LJOl LJ02 A 

| A LAQ2 A 11 J LJ02 A 

l a LAI! LAI2 A 12 k LKOl LKO2 A 

l A LAI2 A 12 K  LK02 A 

| a LAI3 LAI4 A 13 LLO! LLO2 A 

l A LAI4 A 13 L  LLO02 A 

2 b LBO! LBO2 A 14 ll A 2 

2 B  LBO2 A 14 Ll A 2 
14 LL A 2 


15 m LMOl LM02 A 
15 M LM02 A 


16 n LNOI] LNO2 A 
16 N  LN0O2 


> 


3 Cc LCOl LC0O2 A 

3 C LC02 

4 ch l 
4 Ch 1 
4 CH l 
5 d LDO! LDO2 A 

5 D LDO2 A 


>> > 


LN19 LN20 
17 LN20 


6 LEO! LE02 A Oo LOO! LO0O2 A 
6 E  LE02 A 18 O LO02 A 
6 : LE11 LE1I2 A 18 fe) LOI] LO12 A 
6 LE12 A 18 O LOl12 A 
6 : LE13 LE14 A 0) LO13 LOl4 A 
6 LE14 A O LOl4 A 
7 


q LQOl LQ02 A 
2 «Q ~~ LaQ02 A 


9 h LHO1 LH02 A 21 r LRO1] LRO2 A 

9 H  LH02 A 21 R  LRO2 A 
1 LIO1 = LI02 S 

10 I LI02 S 


22 LSO] LSO2 A 
22 LS02 A 
23 


LYTO1 LT02 A 
LT02 


L115 =LI16 


>> >> 


t 
23 T A 


Figure 3-4 (Part 1 of 2). Character Table for Argentina 
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Seq Char ID Mono Alpha Note 
Num case /Num 


24 20 LUO! LU02 A 
U- LU02 A 


24 u LU11 LU12 A 
24 U LU12 A 
A 
A 


Seq Char ID Mono Alpha Note 
Num case /Num 


27 X LX01L LX02 A 
27 x  LX02 A 


28 y  LYOl LYO2 A 
28 Y  LY02 A 


29 Z LZOl LZ02 A 
29 Z  LZO02 A 


All other characters are sorted as 


f 
=e 


shown in Figure 3-1 on page 3-2. 


24 i LUI7 LUIB8 
U _LUI8 


25 Vv LVO! LV02 A 
25 VY LV02 A 


26 w  LWO01 LW02 A 
26 Ww LW02 A 


Figure 3-4 (Part 2 of 2). Character Table for Argentina 

Notes for Figure 3-4: 

1. The combinations ch, Ch, and CH are printed, displayed, and otherwise processed as two characters. 
During sorting, each combination is processed as one character that follows all other c characters. 
For example, the name Cordoba is sorted ahead of Chavez. 

2. The combinations ll, Ll, and LL are printed, displayed, and otherwise processed as two characters. 
During sorting, each combination is processed as one character that follows all other | characters. 


For example, the name Lépez is sorted ahead of Lierena. 


3. Argentina’s currency is now the austral (a capital A with two horizontal bars). 
It does not replace the dollar symbol! ($) in Argentina’s character set. 
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Austria 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


Character Table ) 


Figure 3-5 shows the character table for German as used in Austria. The table 
lists the characters in ascending order with the lowest-weighted character listed 
first. In Austria, some characters are regarded as equivalents and are listed in the 
table with equal weights. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


1 a LAO! LAO2 A 1 
1 A Lao A 4 
1  & LAI7 LAI8 A_ 2 
i A LAI A 4 
2 b  LBOl LBO2 A 
2  B- LB02 A 

c LCOl LCO2 A 

C  Lc02 A 


& [ W& Go 


a 
when 
> 


LDO1 LDO2 A 
LD02 


5 e LEO1 LEO2 A 
5 E  LEO02 A 
6 f LFO!1 LF02 A 
6 F LF02 A 
Z LGO1 LG02 A 
G  LG02 A 


ooj~I~ 


h LHO!] LH0O2 A 
H  LH02 


A 
1 L101 LI02 A 
9 I LI02 


A 

10 j LJO1 LJO2 A 

10 J LJQ2 A 
k 


LKO!1 LK0O2 A 


\O | co 


1] K  LKO2 A 
12 ] LLOl LLO2 A 
12 L  LL02 A 
13 m LMO!l LM02 A 
13 M LMo02 A 
14 n LNO1 LNO2 A 
14 N  LNO2 A 
15 O LOO! LO0O2 A 


15 O LO02 A 


Figure 3-5. Character Table for Austria 


Seq Char ID Mono Alpha Note 


Num case /Num 
15 6 LO17 LOI8& A 2 
15 O LO18 A 4 
16 p LPO1 LP02 A 
16 P LP02 A 
17 q LQ01 LQ02 A 
17 Q = LQ02 A 
18 r LRO1 LRO2 A 
R  LRO2 A 


S LSO1 
LS02 


LS61 


[S02 A 
A 


A 


LS61 


LTO1 
LT02 


LUO1 
LU02 


LU17 LU18 
LU18 


V LVO1 LV02 A 
V LV02 A 


w  LWO!1 LW02 A 
Ww LW02 A 


x LX01 LX02 A 
x  LX02 A 


LY0O1 LY02 A 
LY02 A 


LZ01 LZ02 A 
LZ02 A 


LT02 A 
A 


LU02 A 
A 


A 
A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 
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\ 
Notes for Figure 3-5: Em 


1. For sorting purposes, capital and small letters have equal weight. 
2. The same equivalence applies to letters with diaeresis: 

® aand A are sorted as a, 

¢ 6 and O are sorted as o, and 

e ii and U are sorted as u. 


3. B (LS61) 1s sorted as ss. 


4. Words with original spelling are sorted ahead of words that become equal 
after modifications. 


e Band becomes band, but follows original band. 


e Hauser becomes Hauser, but follows original Hauser. 
e¢ MaBe becomes Masse, but follows original Masse. 
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Belgium 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (if it has a letter ahead of and behind it, as in “l’amour”) (SP05). 


The following special characters are considered numeric: 


All superscripts 

Minus (SA00), plus (SAO1), and plus-or-minus (SA02) 
Less than (SA03) and greater than (SA05) 

Divide (SA06) and multiply (SA07) 

Number (SM0O1), percent (SM02), and asterisk (SM04) 
Left and right brackets (SM06 and SM08) 

Left and nght braces (SM11 and SM14) 

Left and right parenthesis (SP06 and SP07) 

Comma (SP08) and period (SP11) 

Slash and backslash (SP12 and SMO07) 

Colon (SP13). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


Character Table 


Figure 3-6 shows the character table for Belgium. The table lists the characters in 
ascending order with the lowest-weighted character listed first. In Belgium, some 
characters are regarded as equivalents and are listed in the table with equal 
weights. This means that, after a sort, these characters can be intermixed like, for 
example, the following words: 


Unsorted list Sorted list 
essen Eingang 

et enumeration 
Eingang epoque 
epoque essen 
enumeration et 

etoile étoile 
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Seq Char ID Mono Alpha Note 
Num case /Num 


Seq Char ID Mono Alpha Note 
Num case /Num 


12 h LH01 LH02 A 
12 H  LH02 A 


rr Prrrrrry 


, LAI7 LAI8& A 
A LAI8 


14 1 4Li6l Lo2 A 

16 if LISI A 48 
16 j LJol Loz A 

16 J Lyo2 A 

17 k  LKOl LK02 A 
17 K  LK0o2 A 


18 l LLOI LLO2 A 
18 L  LL02 A 


19 m LM01 LM02 A 
19 M LM02 A 


WwW Leo 
ro) 


20 on LNOL LNO2 A 

20 N_ LNO2 A 

6  LD63 LD62 A 21 &  LNI9 LN20 A 

HD LD62 A 21 N  LN20 A 

9 e LEO! LEO2 A 22 +o LOO! LOO2 A 
9 E LE02 A 22 ~O A 
9  é€ LEll LEI2 A 22. «6 A 
9  E LEI2 A 22 =O A 
9 é&é LEI3 LEI4 A 22. «6 A 
9 E LEI4 A 2 OO A 
9  @ LEIS LEI6 A 22 «6 A 
9 E  LEI6 A 2 20 A 
9  é& LEI7 LEI8 A 22 6 A 
B LEI8 A 22 O A 

10 f  LFOl LFO2 A 6 A 
; : 
11g LGOl LGO2 A 2 A 
11 G  LG02 A DQ A 


Figure 3-6 (Part | of 2). Character Table for Belgium 
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Char ID Mono Alpha Note 
case /Num 


Seq Char ID Mono Alpha Note 
Num case /Num 


24 ce LO51 LOS52 A 5,8 LUIS LU16 

24 C LOS52 A 5,8 LU16 

25 p LPOl LP02 A a LUI7 LUI8 A 
25 P  LP02 A U LU18 A 
26 q LQ01 LQ02 A Vv LVO1 LV02 A 
26 Q  =LQ02 A Vv LV02 A 
27 r LRO! LRO2 A w  LWO1 LW02 A 
27 R  LR02 A Ww LW02 A 
28 § LSO!1 LSO2 A xX LX01 LX02 A 


LS02 xX  LX02 A 


LS61 


A 
A 


28 


LS61 


t LTO1 LT02 A 
30 T  LT02 A 


31 pb LT63 LT64 A 
Bb LT64 


LZ01 LZ02 A 
LZ02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Figure 3-6 (Part 2 of 2). Character Table for Belgium 
Notes on Figure 3-6: 


1. This sort sequence is based on the Latin Alphabet Number | character set. Explanations 
at the beginning of this chapter deal with products supporting other character sets. 


2. as sorted as aa. 

3. aand @ are sorted as ae. 

4, ij ligature is sorted as two letters, i followed by j. 
5. 6,@ and @ are sorted as oe. 

6. B (LS6]1) is sorted as ss. 

7. tis sorted as ue. 


8. ij and @ are not part of the Latin Alphabet Number | character set, but they are included 
here because they are part of the Dutch language (ij) and the French language (cc). 
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Bulgaria 
Character Table 


Figure 3-7 shows the Cyrillic-character table for Bulgaria. The table lists the characters in ascending order 
with the lowest-weighted character listed first. 


Seq Char ID 


Mono Alpha Note Seq Char ID Mono Alpha Note 


Num case /Num Num case /Num 
l a KAO! KA02 A 17 p KRO! KRO2 A 
l A  KA02 A P KRO02 
2 6 KBO] KBO2 A KS01 KS02 
2 b KBO02 A KS02 
3 B KV0Ol KV02 A T KT01 KT02 
3 B KV02 A 19 T  KT02 A 
4 r KG0O1 KG02 A 20 y KU01 KU02 A 
4 T KG02 A 20 VY KU02 A 
21 om KFOl KFO2 A 
| 21 @D KF02 A 
22 X KHO!l KH02 A 
KE02 22 xX KHO02 A 
7 "” =©6. KKZ21 KZ22 =A 23 Ll KCOl1 KC02 A 
7 4K KZ22 A 23 Ii KC02 A 
8 3 KZ01 KZ02 A 24 Y KC21 KC22 A 
8 3 KZ02 A YW -K@€22 
9 Mi KIO! KI02 A wm KS21 KS22 
9 MW KIO02 A 25 UI KS22 A 
10 vi KJi1 KJ12 A 26 um KSI5 KS1l6 A 
10 Me KJl2 26 lil KS16 A 
11 K KKOl KKO02 A 27 b KU21 KU22 A 
11 K KK02 A 27 “"'b KU22 A 
12 I KLO1 KLO2 A 28 b KX11 KX12 A 
12 JI KLO02 A 28 b KX12 A 
13 mM KMO0l KM02 A 29 ro KUI5 KUI6 A 
13 M KM02 A 29 IO KUI16 A 
14 H KNO1 KNO2 A 30 A KAI5 KAI6 A 
14 H KNO2 A A  =6©KAI6 
15 re) KOO! KOO2 A 
15 QO KO02 A All other characters are sorted as 
16 i KP01 KP42 A shown in Figure 3-1 on page 3-2. 


16 IT KP02 


Figure 3-7. Character Table for Bulgaria 
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Canada 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required) (SP30) 

e Hyphen (required and syllable) (SP10 and SP32) 

e Apostrophe (if it has a letter ahead of and behind it, as in “l’amour” and 
“don't”) (SPOS). 


The following special characters are considered numeric: 


Comma (SP08) and period (SP11) 

Plus (SA01) and minus (SA00) 

Space (numeric) (SP31) 

Left and night parenthesis (SP06 and SP07). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


¢ Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


Character Table 


Figure 3-8 shows the character table for Canada. The table can be used for both 
Canadian French and Canadian English. The table lists the characters in 
ascending order with the lowest-weighted character listed first. In Canada, some 
characters are regarded as equivalents and are listed in the table with equal 
weights. After a sort, these characters can be intermixed like, for example, the 
following words (in Canadian French): 


Unsorted list Sorted list 
étre entreprise 
espace enumeration 
entreprise époque 

et espace 
époque et 
énumeration étoile 

étoile etre 


Both in Canadian English and Canadian French, the uppercase and the lowercase 
of a letter (for example, a and A) have equal weights. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


LA0L LAQ2 A 


Seq Char ID Mono Alpha Note 
Num case /Num 


9 i LIi7 = LIl8 A 


LA02 A 9 I LI18 A 
LA13 LAl4 A 10 j LJOl LJ0O2 A 
LA14 A 10 3J LJO2 A 
LA15 LAI6 A 11 k LKO1 LK02 A 


A LAI6 A LK02 A 


b LBO! LBO2 A LLO1 LLO2 A 

B LBO2 A 12 L  LLO02 A 
3 c LCOl LCO2 A 13 m LMO0O!l LM02 A 
3 C LC02 A 13 M LM02 A 
3 ¢ LC41 LC42 A 14 n LNOL LNO2 A 
3 GC LC4 A 14 N  LNO2 A 
4 y Oo LOO! LOO2 A 


15 O LO02 A 
LOI5 LOI6 A 


d LDO1 LDO2 A 
4 D  LD02 A 


LEO] LEOQ2 A 


5 6 
5 LE02 A {15 OO  LO16 A 
5 LE11 LE12 A 16 p  LPOL LPO2 A 
5 LE12 A 16 P  LP02 A 
5 LE13 LE14 A 17 q  LQO1 LQ02 A 
5 LE14 A 17 Q  LQo2 A 
5 LEIS LE16 A 18 +r  LROl LRO2 A 
5 LE16 A 18 R  LRO2 A 
5 LE17 LEIS A 19 s  LSO1 LS02 A 
5 LE18 A 19 S  LS02 A 
6 


t LTO! LYT02 A 
20 T  LT02 


LFOl LFO2 A 
6 LFO02 A 
7g  LGOl LGO2 A 
7 G_ LG02 A 


> 


h LHO1 LHO2 A 
H  LHO02 


LI01 LI02 
LI02 
Lis =LI16 
LI16 


8 
8 


>>> > DD DD 


Figure 3-8 (Part | of 2). Character Table for Canada 
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Seq Char ID Mono Alpha Note Seq Char ID Mono Alpha Note 


Num case /Num Num case /Num 
22 V LV0O1 LV02 A 26 Z LZ01 LZ02 A 
22 VV LV02 A 26 Z LZ02 A 


23 w LWOI LW02 A 
23 Ww LW02 A 


24 x% LxX01 LX02 A 
24 X  LX02 A 


25 y LY01 LY02 A 
25 y  LY02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Figure 3-8 (Part 2 of 2). Character Table for Canada 
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Czechoslovakia 
Character Table 
Figure 3-9 shows the character table for Czechoslovakia. The table lists the characters in ascending order 


with the lowest-weighted character listed first. In Czechoslovakia, some characters are regarded as 
equivalents and are listed in the table with equal weights. 


Seq Char ID Mono Alpha Note Seq Char ID Mono Alpha Note 


Num case /Num Num case /Num 
1 a LAOL LAO2 A 17 i Liti Lil2 A 
1 A LA02 A 17 { LhH2 A 
2 a LAlIl LAI2 A 18 j LJOL LJo2 A 
2 A LAI2 A ig J LJo2 A 
3 4 LAI7 LAI8& A 19 k  LKO! LKO2 A 
3 A LAI8 A 19 K_ LK02 A 
4b  LBOL LBO2 A 20 1 £LLOl LLO2 A 
4 B  LB02 A 20 L_ LLO2 A 
5 c LCOl LCO2 A 21 +%|{ LL LLI2 A 
5 C LC02 A 21 L  LLI12 A 
6 é  LC21l LC22 A 22 %I  LL21 LL A 
6 C LC22 A 22 L_ LL22 A 
7 d LDO! LDO2 A 23 m LMO0! LM02 A 
i D  LD02 A 23 M_ LMO02 A 
8 d LD21 LD22 A 24 on LNOI LNO2 A 
8 D LD22 A 24 N_ LNO2 A 
9 e LEO! LEO2 A 25 ft  \LN21 LN22 A 
9 E  LE02 A 25 N_ LN22 A 
10 é€ LEI LEI2 A 26 o LOO! LOO2 A 
10 E LEl2 A 26 O- LO02 A 
11 & LE21 LE22 A 27 +6 \LOLl LOI2 A 
11 E  LE22 A 27 O- LO12 A 
12 f LFOl LFO2 A 28 6 #$\LOIS LOI6 A 
12 F  LFO02 A 2 OO LOI6 A 
13. +g  LGOl LGO2 A 


G LG02 A 


29 6 LOI7 LOI8 A 
22 OG LOIB8 A 
30 


LHOl LHO2 A p LPO! LPO2 A 
LH02 30 P  LPo2 A 

31 q  LQOl LQO2 A 

31 Q LOO A 

i LIO1 LI02 32. +r  LROl LRO2 A 


LRO2 A 


Figure 3-9 (Part 1 of 2). Character Table for Czechoslovakia 
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Mono Alpha Note Mono Alpha Note 


Seq Char ID 


Seq Char ID 


Num case /Num Num case /Num 
33 t LR11 LR12 A 42 a LU17 LUI8 A 
33 R  LR12 A 42 U LUI18 A 
34 t LR21 LR22 A 43 V LVOl LV02 A 
R  LR22 A 43 Vv LV02 A 
44 Ww LWO0!1l LWO02 A 
44 W LW02 A 
36 y LS21 LS22 A 45 x LX01 LX02 A 
36 S LS22 A 45 x LX02 A 
37 t LTO1 LT02 A 46 y LY0O1 LY02 A 
37 T LT02 A 46 Y LY02 A 
38 t LT21 LT22 A 47 y LY11 LY12 A 
38 T LT22 A 47 Y LY12 A 
39 u LUO! LU02 A 48 Z LZ01 LZ02 A 
39 U  LU02 A 48 Z  LZ02 A 
40 u LU11 LU12 A 49 Z L721 LZ22 A 
40 U LU12 A 49 Z LZ22 
41 U LU27 LU28 A All other characters are sorted as 


U LU28 A shown in Figure 3-1 on page 3-2. 


Figure 3-9 (Part 2 of 2). Character Table for Czechoslovakia 
Notes for Figure 3-9: 


1. The combinations ch and Ch are printed, displayed, and otherwise processed as two characters. 
During sorting, each combination is processed as one character that follows the h and H characters. 
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Denmark 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32). 
Character Table 


Figure 3-10 shows the character table for Denmark. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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2 


Seq Char ID Mono Alpha Note 
Num case /Num 


LBO! LBO2 A 


LCOl LCO2 A 
8 LC02 A 
LDO1 LDO2 A 


LE11 LE12 A 
LE13 LE14 A 
LE15 LE16 A 
LE17 LE18 A 


15 LFO!1 LF0O2 A 


E 
E 
EB 
f 


Seq Char ID Mono Alpha Note 


Num case /Num 
21 1 L101 Li02 A 
Lili LI{12 


1 
22 i LM3 Lil4 A 
22 1 LMS LiH6 A 


LIW7 =LIT18 


LKO1 LKO2 A 


LK02 A 
29 ] LLO1 LLO2 A 


L  LL02 


M LMO02 


33 n LNO! LNO2 A 
34 N  LNO2 A 


35 re) LOO! LOO2 A 


36 Oo LO1l LOI2 A 
36 Oo LO13 LO14 A 
6 LOIS LOI16 A 


Figure 3-10 (Part 1 of 2). Character Table for Denmark 
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3-23 


Seq Char ID Mono Alpha Note 
Num case /Num 


50 u LUI1 LUI2 A 
50 u LUI3 LUI4 A 
50 a LUIS LUI6 A 


60 y 
60 y 
61 ui 


LY11 
LY17 
LU17 


LY12 A 
LY02 A 
LUI8 A 


Figure 3-10 (Part 2 of 2). 


Seq Char ID Mono Alpha Note 
Num case /Num 


75 a LA27 LA28 
76 LA28 


> 


Character Table for Denmark 
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Finland 

Character Classification Considerations 

The following characters are classified as alphabetic: 
Special characters 
Space (required and numeric) (SP30 and SP31) 


Hyphen (required and syllable) (SP10 and SP32) 
Apostrophe (when used within words) (SP05). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00) 
Asterisk (SM04). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


Character Table 


Figure 3-11 shows the character table for Finland. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


] a LAO! LA02 A 


LCOl] LCO2 A 
LC02 A 


10 e LEO! LE0O2 A 


LE|2 
LE13  LEO2 A 
LE15 LEO2 A 
17 =f LFO! LF0O2 A 

22 4H 

3.01 

4 | 


A 
2 j LJol Ljo2 A 
0 A 
1 
2 


K 
l 


Figure 3-11 (Part 1 of 2). 


Seq Char ID Mono Alpha Note 
Num case /Num 


33 n LNOI] LNO2 A 
34 N  LNO2 A 


35 O LOO! LOQ02 A 


36 O  LO02 A 


37 +p  LPOl LPO2 A 


LQ01 


40 Q. LQ02 A 
42 R_ LRO2 A 
43s  LSOL LSO2 A 


LS02 A 


Character Table for Finland 
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Seq Char ID Mono Alpha Note 
Num case /Num 


65 6 LOI7 LOI8 A 
66 O- LOI8 A 


Seq Char ID Mono Alpha Note 
Num case /Num 


67 0 ND10 N 
68 l NDO! N 
69 2 ND0O2 N 
70 3 ND0O3 N 


Figure 3-11 (Part 2 of 2). Character Table for Finland 
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France 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


@ Required space (SP30) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (if it has a letter ahead of and behind it, as in “l’amour’’) (SP05). 


The following special characters are considered numeric: 


e Numeric space (SP31), 
e Comma (SP08), and 
e Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


The following character strings should be given a sort weight between spaces and 
alphabetic characters, if they are at the beginning of a sort field: 


e d’, and D’ 
e |’, and L’. 


The following character strings should be given a sort weight between spaces and 
alphabetic characters, if they are at the beginning of a sort field and if they are 
immediately followed by a space: 


e de, De, des, Des, and du 
e la, La, le, Le, les, and Les. 
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Character Table 


Figure 3-12 shows the character table for French. The table lists the characters in 
ascending order with the lowest-weighted character listed first. In France, some 
characters are regarded as equivalents and are listed in the table with equal 
weights. After a sort, these characters can be intermixed like, for example, the 


following words: 


Unsorted list 


espace 
entreprise 

et 

époque 
énumeration 
étoile 


Sorted list 


entreprise 
enumeration 
eépoque 
espace 

et 

étoile 
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Seq Char ID Mono Alpha Note 
Num case /Num 


LAO] LA02 A 
LA02 A 


4  LAI3 LAl4 A 
A LAI4 A 


4 LAIS LAI6 A 
A LAI6 A 
2 LAS! LAS2 A 


NON YS YS 


A 


LB02 A 
LBO2 A 


LCOl] LCO2 A 
LC0Q2 A 


LC41 LC42 A 3 
C Lc4 A 3 


d LDO1 LDO2 A 
D LD02 A 


LEO! LE02 A 
LE02 A 


€ 

E 

é LEI! LEI2 A 
B  LEI2 A 
é 
E 


LBO! 


LE13 LEI4 A 
LE14 A 


LEIS LE16 A 
LE16 A 


é 

E 

é LEI7 LEI8 A 
E LEI8 A 
f 

F 
g 


hE AH KH KAA 


LFO! LFO2 A 
LF02 A 


7 LGO1 LG02 A 
7 G LG02 A 
8 h LHO1 LHO2 A 
8 H LH02 A 
A 
A 


Figure 3-12 (Part 1 of 2). 


S LSOI LSO2 A 

{19 S LS02 A 
20 t LTO1 LT02 A 
T  LT02 A 


Seq Char ID Mono Alpha Note 


Num case /Num 
9 i L117 ~LIl8 A 5 
9 ] LI18 A 5 
10 j 1J01 LJO2 A 

J LJ02 


LKO1 
LKO2 


LKO2 A 
A 


K 


LLO! LLO2 A 
12 L LLO02 A 
13 m LMO! LM02 A 


M LMo02 


n LNOI 
N  LNO2 


LOO1 
LOO02 


LOI5 


A 
LNO2 A 


LPOl] LPO2 A 

16 P  LP02 A 
17 q  LQO1l LQ02 A 
17 Q  LQ02 A 
18 +r  LROl LRO2 A 
R  LRO02 A 


LU17 
LU18 


210 it 


PPrPrr rr PP 


Character Table for France 
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Figure 3-12 (Part 2 of 2). 


22 
23 


25 


Seq Char ID Mono Alpha Note 
Num case /Num 
22 Vv LV0O1 LV02 A 


VV LV02 A 


w LWO1 LW02 A 
23 Ww LWwo2 A 
24 X LX01 LX02 A 
24 xX  LX02 A 
25 y LY01 LY02 A 


Y  LY02 A 


Notes for Figure 3-12: 


l. 


2; 


3: 


4. 


5. 


a and a are sorted as a. 

z is sorted as ae, and @ is sorted as oe. 
¢ is sorted as c. 

é, 6, 6, and é are sorted as e. 

i and i are sorted as 1. 

6 1s sorted as o. 

u, U, and ti are sorted as u. 


y 18 sorted as y. 


Seq Char ID Mono Alpha Note 
Num case /Num 


25 ¥  LYI7 LYI8 A 8 
25 Y  LYI8 A 8 
2 z  LZ01 LZ02 A 
2% Z LzZ02 A 


All other characters are sorted as 


shown in Figure 3-1 on page 3-2. 


Character Table for France 
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German Democratic Republic (GDR) >) 
Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


e Period (SP11) 
¢ Comma (SP08) 
e Plus (SA01) 

e Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


Character Table | ) 
Figure 3-13 shows the character table for German as used in the GDR. The 
table lists the characters in ascending order with the lowest-weighted character 


listed first. In GDR, some characters are regarded as equivalents and are listed in 
the table with equal weights. 


3-32 National Language Information and Design Guide Volume 2 


Seq Char ID Mono Alpha Note Seq Char ID Mono Alpha Note 


Num case /Num Num case /Num 
1 a LAO] LAO2 A 15 QO LOO02 A 
1 A LAQ2 A 15 6 LO17 LOI8 A l 
1 a LAI7 LAI8& A l 15 O LOI8 A l 
1 A LAI8 Al 16 p  LPOl LP0o2 A 
2 b LBO1 LBO2 A 16 P LP02 A 
3 c LCO1l1 LCO2 A 17 Q LQ02 A 
d LDO1 LDO2 A 18 R  LRO2 A 
D  LD0O2 A LSO! LSO2 A 


nib fb 


oo. 
coped boonies, 


LS02 A 
20 8 . LS61 LS6l A 2 
LTO! LT02 A 


wr 


LEO1 LEO2 A 

LE02 A 
LFO!L LFO2 A 
LF02 A 


7 g LGO1 LG0O2 A LUO! LU02 A 
7 G LG02 A 22 U  LU02 A 
8 h LHO! LHO2 A LU17 LUI18 A 
8 H  LH02 A LU18 A 
9 i LI0l LI02 A Vv LVO1 LV02 A 
9 I LI02 A 23 Vv LV02 A 
10 j LJ0O1 LJ02 A 24 w LWO1 LW02 A 

J LJ02 24 Ww LWwo2 A 


LKO1 
LK02 


LK02 A 
A 


25 X LX01 LX02 A 
25 x  LX02 A 


LLO1 LLO2 A 26 y LYO1 LY0O2 A 
12 L LLO2 A 26 Y LY02 A 
13 m o.LMO0O1l LM02 A 27 Z LZO! LZ02 A 
13 M LM02 A Zz LZ02 A 
14 n LNO!L LNO2 A 
14 N LNO2 A All other characters are sorted as 


LOO1 


LO02 A 


15 oo shown in Figure 3-1 on page 3-2. 
Figure 3-13. Character Table for GDR 
Notes for Figure 3-13: 


1. &41s sorted as a, 6 is sorted as 0, and ti 1s sorted as u. 


2. & (LS61) is sorted as ss. 


Chapter 3. Technical Characteristics 3-33 


Germany, Federal Republic of (FRG) 
Character Classification Considerations 
The following special characters are classified as alphabetic: 
@ Space (required and numeric) (SP30 and SP31) 


e Hyphen, Minus Sign (SP10) 
e Syllable Hyphen (SP32). 


The following special characters are considered numeric: 


Period (SP11) 

Comma (SP08) 

Hyphen, Minus Sign (SP 10) 
Plus (SA01). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Syllable Hyphen (SP32) 
e Hyphen, Minus Sign (SP10) 
e Apostrophe (SP05). 


Standards Note: The German Standard DIN 5007 draft Feb 85 provides a 
slightly different sort sequence. 


Character Table 
Figure 3-14 shows the character table for FRG, which is based on the 
Multilingual Code Page (MLP). The table lists the characters in ascending order 


with the lowest-weighted character listed first. In FRG, some characters are 
regarded as equivalents and are listed in the table with equal weights. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


1 a LAO] LAO2 A 1 
A LA02 A 


— 


2 a LAI LAI2 A 
2 A LAI2 A 
2 a  LAI3 LAI4 A 
2 A LAl4 A 
2 a LAIS LAI6 A 
2 A LAI6 A 
2  & LAI7 LAI& A 3 
2 A LAIB A 3 
2  & LAI9 LA20 A 
2 A LA2 A 
2  & LA27 LA2® A 
2 A LA28 A 
2  « LAS! LAS2 A 
2 A 
3 LBO1 LBO2 A 
3 LB02 A 


Oma 


eS 
‘oe 
> 


LCOi LC02 A 
LC02 


5 ¢ LC4 LC4 A 
5 C¢ Lc4a A 
6d LDOl LDO2 A 
6 D_ LD0 A 
7 8 LD63 LD62 A 
7 Bb LDe A 
8 e LEO! LEO2 A 
8 E LEO A 
9 € LEll LEI2 A 
9 6 LBEI2 A 
9 é LEI3 LEI4 A 
9 6 LEI4 A 
9 é@ LEIS LEI6 A 
9 LE16 A 
9 LE17 LEI8 A 
9 A 


g LGO1 
G LG02 


LG02 A 
A 


Seq Char ID Mono Alpha Note 
| Num case /Num 


LI01 
LI02 


LKOI LKO2 A 
16 K  LK02 A 


17 ] LLOI LLO2 A 
L  LL02 A 


m LM01 LM02 A 
M LM02 A 


n LNOL LNO2 A 
19 N  LNO2 A 


20 fh .oLN19 LN20 A 
20 N  LN20 A 


LOO! LO0O2 A 


6 LOllL 

O LO12 A 

6 LOI LOI4 A 

O LO14 A 

6 LOIS LOI6 A 

O LOI6 A 

6 LOI7 LOI8 A 3 
O LOI8 A 3 
6  LOI9 LOZ A 

6 L020 A 

9  LO61 LO62 A 

@ L062 A 


LPO!l LPO2 A 
LP02 A 


eee) 


Figure 3-14 (Part 1 of 2). Character Table for FRG 
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Seq Char ID Mono Alpha Note 
Num case /Num 


24 q LOO LOO A 
24 Q LO02 A 


25 r LROL LRO2 A 


Seg Char ID Mono Alpha Note 
Num case /Num 


31 & LUI7 LUI8 A 
31 U  LUuIs A 


32 V LVO1 LV02 A 


25 R  LRO2 A 32 V_ LV02 A 
26 S LSO1 LSO2 A 33 w  LWO01 LW02 A 
= - LS02 > 33 Ww LW02 A 

LS61_ LS6l 34 X LX01 LX02 A 


34 LX02 A 
35 LY01 LY02 A 


35 LY02 A 


28 +t  LTOl LT02 A 
28 T  LT02 
29 «pb  LT63 LT64 A 
29 +b LT64 


LUO! LU02 r 
LU02 A 


36 y  LYll LYI2 A 
36 «=6pY~——CO@LYY112 A 
y 


36 LY17 LY02 A 


LZ01 LZ02 A 
LZ02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Figure 3-14 (Part 2 of 2). Character Table for FRG 
Notes for Figure 3-14: 
1. Small letters precede the capital letters, and letters without diacritics precede letters with diacritics. 
2. Equal letters with different diacritics are sequenced according to the Character Identifier. 
3. When the sort is according to the sound of the words as in telephone directories, address lists, then: 
® 4 4is sorted as ae, 6 is sorted as oe, and ti is sorted as ue. 
For other sorts such as dictionaries, encyclopedias, then: 


® 4 is sorted as a, 6 is sorted as 0, and ii is sorted as u. 


4. B(LS61) is sorted as ss. 
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Cc Greece 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required) (SP30) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


e Period (SP11) 
¢ Comma (SP08) 
e Plus (SA01) 

e Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e All special characters. 
Character Table 


Figure 3-15 shows the character table for Greece. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


] A GA02 A 
‘A GAI2 A 
a GA01 GA02 A 


Wid 


GD02 A 
GD0O1 GD02 A 


18 H 
9 


~ 


A 
Z 
t 
t 


7 : A 


0 

Z 

3 
2 
26 


Seq Char ID Mono Alpha Note 
Num case /Num 


‘(O GOl12 


43 If GP02 A 
44 +n GP01 GP02 A 
45 P GRO2 A 


46 p  GROI GRO2 A 
47 ¥  GS02 A 


48 o GSOl1 GS02 A 
49 s GS61 GS02 A 
50 T GT02 A 
53 ‘YY GU12 A 


54 v GU01 GU02 A 


55 0 GU11 GU12 A 
58 @D GF02 A 


~ 


60 GH02 A 
61 GHO!1 GH02 A 


x 
62 W GP62 A 
63 w  GP6l GP62 A 
64 Q  GO32 A 


Figure 3-15 (Part ! of 3). Character Table for Greece 
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Seq Char ID Mono Alpha Note 
Num case /Num 


65 Q =GO72 A 


o GO31 GO32 A 


® GO71 GO72 A 


68 A LAOQ2 A 
70 a LAO] LA02 A 
LA13 LA02 A 
LAI5 LA02 A 


75 wb  LBOL LBO2 A 


A 


LCol 
LC41 


LC0O2 A 
LC42 A 


[84 é LEI LEO? A 
92 H  LH02 A 
95 iLO) L102 A 


Seq Char ID Mono Alpha Note 
Num case /Num 


105 m 
06 A 
107 on 
08 A 
09 A 


K 
L 


11Is = Q = LQ02 A 
116 q LQ01 LQ02 A 
117 R  LRO2 A 
118 =O LROL LRO2 A 
119 «6S LS02 A 
120s LSOL LSO2 A 

T 


27 #6u 
28 


126 ou LU13 LU02 A 
LUIS LU02 A 
l LU17 LUI8& A 


Figure 3-15 (Part 2 of 3). Character Table for Greece 
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Seq Char ID Mono Alpha Note 
Num case /Num 


Figure 3-15 (Part 3 of 3). Character Table for Greece 


Seq Char ID Mono Alpha Note 
case /Num 


N 


N 
N 
N 
N 
N 
N 
N 
N 
N 


3-40 National Language Information and Design Guide Volume 2 


C 


Hungary 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 
Period (SP11) 
Comma (SP08) 


e 
@ 
e Plus (SA01) 

e Miunus (SA00). 

Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required or syllable) (SP10 and SP32). 
Character Table 


Figure 3-16 shows the character table for Hungarian. The table lists the 


characters in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 


Num case /Num 
] a LAO] LAO2 A 
l A LA02 A 
2 a LAI] LAI2 A 
2 A LAI12 A 
3 b LBO1] LBO2 A 
3 B LBO02 A 
4 Cc LCO1 LC02 A 
4 C LC02 A 
5 LDO!1 LDO2 A 


LDO2 


LE01 LE0O2 A 
LE02 A 


: LE11 LEI2 A 
LE12 


LF01 


w~ 
whe 
> 


ON ON 
tri @ 


Co oO] ~) ~]) 
| the 
> >| > 


LF02 A 


LF02 
9 g  LGOl LGO2 A 
9 G_ LG02 A 
10 h  LHOl LHO2 A 
10 H  LHOo2 A 
11 i LIOL Llo2 A 
11 I LI02 A 
12 i Lill Liu2 A 

i Ln2 


j sol LJo2 
13 J Lyso2 A 
14 k  LKOl LKO2 A 
14 K  LKo2 A 
is 1  LLOL LLO2 A 
15 L Lio A 
16 m LMOl LM02 A 
16 ~M_ LMo2 A 
17 nn LNOL LNO2 A 
17 N_ LNO2 A 
18 o  LOOL LOO2 A 


QO LO02 A 


Figure 3-16. Character Table for Hungary 


Seq Char ID Mono Alpha Note 


Num case /Num 
19 re) LO11 LOI2 A 
19 O LOl12 A 
20 6 LO17 LOI18 A 
20 O LOI18 A 
21 oF LO25 LO26 A 
21 O LO26 A 
22 p LPO] LP02 A 
22 P LP02 A 
23 q LQ01 LQ02 A 
23 Q LQO02 A 
24 r LRO1 LRO2 A 


24 R_ LR02 A 
25. s  LSOl LS02 A 
25 $§  LS02 A 
26 +t  LYOl LYo2 A 
2 TT  LT02 A 
27 wu -LUOL LU02 A 
27 U~ Lu02 A 
28 & LUI1 LUI2 A 
28 U  LUI2 A 
29 i LUI7 LUI8& A 
2 U LUIB A 
30 & LU25 LU26 A 
30 GU LU. A 
31 v LVOl Lvo2 A 
31 Vv Lvo2 A 
32. w  LWO01 LWo2 A 
32, W LWwo2 A 
33 x LXO1 LX02 A 
33 X  LXx02 A 
34 y  LYOl LY02 A 
34 «= -¥-~—sCLY02 A 
35 z  LZOl LZ02 A 
35 Z LzZ02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 
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Cc Iceland 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SAQ1) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32). 
Character Table 


w Figure 3-17 shows the character table for Iceland. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


LAO! LA02 A 
A LA02 A 


4 LAI7 LAI& A 
A LAI8 A 


LA27 LA28 A 
A 


B 
4 c LCOl LCO2 A 
4 C LCQ2 A 
5 
5 


d LD01 LDO2 A 
D  LD02 A 
6 6) LD63 LD62 A 
6 D LD62 A 
e 


LEO2 A 
LE02 A 


LE18 A 


LFO1 LFO2 A 
9 LF02 A 


10 gZ LGO!1 LGO02 A 
10 G LG02 A 
1] h LH01 LH02 A 
1] H  LH02 A 


LI0O1 LI02 
LI02 A 


LI7 =6LIN8 


i Lill LIi2 
13 f  Lil2 A 


LJOl LJ02 


Seq Char ID Mono Alpha Note 
Num case /Num 


15 k LKO1 LK02 A 
15 K  LKO2 A 
161 LLO! LLO2 A 


16 
17 


L  LLO2 A 
m LM0Ol LM02 A 


17 M LMo02 A 
18 n LNO] LNO2 A 
18 N  LNO2 A 
19 te) LOO! LO0O2 A 


19 O  LO02 A 


2 6 \LOM LOI A 
2 O LOl2 A 


21 p LPO! LPO2 A 
21 P  LP02 A 


22 q  LQ0l LQO2 A 
22 Q LQ A 


23 r LRO1 LRO2 A 
23 R  LR0O2 A 
24 § LSO1 LS0O2 A 
24 S LS02 A 
25 t LTO! LT02 A 
25 T  LT02 

26 


LUO! LU02 
26 U LU02 


26 LU17 LU18 
26 LU18 


a a 


27 + LUIL LUI2 A 
27 U~ LUil2 A 
28 v  LVO01 Lvo2 A 
23 V Lv02 A 
29 +w  LW01l LW02 A 
29 W_ LWwo2 A 
30 x  LX01l Lx02 A 
30 xX  Lx02 A 
31 y  LYOl LYO2 A 
31 YY Ly02 A 


oe 


31 y 


LY17 A 


Figure 3-17 (Part 1 of 2). Character Table for Iceland 
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Seq Char ID Mono Alpha Note 


Num case 


32. y LYIl LYI2 A 
32, -¥Y ~~~LY12 A | 
33. =z LZOl LZ02 A 
33. Z  LZ02 
b 
p ss 


LT63 LT64 
LT64 


z LAS] LAS2 
AL LAS52 


/Num 


A 
A 
A 
A 
A 


Seq Char ID Mono Alpha Note 
Num case /Num 


36 6 LOIT7 
36 «3O LOI8 A 


36 2 LO61 LOQ62 
36 MG LOQ62 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Figure 3-17 (Part 2 of 2). Character Table for Iceland 
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Italy 


Character Classification Considerations 
The following special characters are considered numeric: 


Superscript 2 and superscript 3 (ND021 and NDO31) 
One-half, one-quarter, and three-quarters (NF01, NF04, and NFO0S5) 
Minus, plus, and plus-or-minus (SA00, SA01, and SA02) 
Less-than, equal, and greater-than (SA03, SA04, and SA0S) 
Pound sign and dollar sign (SC02 and SC03) 

Percent and asterisk (SM02 and SM04) 

Square brackets (left and right) (SM06 and SM08) 

Braces (left and right) (SM11 and SM14) 

Micro symbol (SM17) 

Degree (temperature) symbol (SM19) 

Parentheses (left and right) (SP06 and SP07) 

Comma, period and slash (SP08, SP11 and SP12) 

Space (SP31). 


Sort Sequence Considerations 


The following characters do not change the sort position and must be treated by 

the sort as if they were not present. An example of this implementation is the 

‘“DC IXI Control Word” of SCRIPT/VS, which allows the listing of characters 2) 
to be deleted from index entries (not from the record) before sorting. , 


Hyphen (required or syllable) (SP10 and SP32) 
Apostrophe (SP05) 

Ampersand (SM03) 

Space and required space (SP01 and SP30) 
All (stand alone) diacritics 

Period 

Comma 

Colon 

Semicolon 

Quotes and quotation marks 

Slash and reverse slash 

Parentheses, brackets, and braces. 


Character Table 


Figure 3-18 shows the character table for Italy. The table lists the characters in 
ascending order with the lowest-weighted character listed first. In Italy, some 
characters are regarded as equivalents and are listed in the table with equal 
weights. After a sort, these characters can be intermixed. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


Seq Char ID Mono Alpha Note 
Num case /Num 


a LAO] LA02 A I 14 n LNO1 LNO2 A 
A LA0O2 A 14 N  LNO2 A 
a LAI3 LAl4 A 15 9) LO01 LO02 


NIN) = —_| —_— — 


A 

A LAI4 A 15 OO LOO02 A 

b  LBOl LBO2 A 15 6 LOI3 LOI4 A 

B LBO2 A 15 O LO14 A 
3 c LCO1 LCO2 A 16 p LPO! LPO2 A 
3 C LC02 A 146 PP  LP02 A 
4 d LDO1 LDO2 A 17 q  LQ01 LQ02 A 
4 D LD0O2 A 17. Q_ LQo2 A 
5 e LEO! LEO2 A 18 LROI] LRO2 A 
5 E  LE0O2 A 18 R LRO2 A 
5 é LEll LEI2 A 19 5s LSO1 LSO2 A 
5 E  LE12 A 19 S§  LS02 A 
5 é LEI3 LEl4 A 20 t LTO1 LYT02 A 
5 E  LEI4 A 20 TT CLYT02 A 
6 f LFO1 LFO2 A 21 =u LUOl LU02 A 
6 F  LF02 A 21 U  LU02 A 


21 u LUI3 LUI14 A 
21 U- LUI14 A 


LGO1 LG02 A 
G LG02 A 


~~ 
ge 


8 h LHO!1 LHO2 A 22 Vv LVOl1 LV02 A 
8 H  LH02 A 22 Vv LV02 A 
9 1 LIO1 LI02 A 23 w LWO01 LW02 A 
9 I LI02 A 23 W LW02 A 
9 i LM3 Lil4 A 24 X LX01 LX02 A 
I LIl4 A 24 x  LX02 A 
j IJ01 LJO2 A 25 y LY01 LY02 A 
J LJ02 A 25 Y  LY0O2 A 


LKO!] LKO2 A 
I] K  LKO2 A 


121 LLO1 LLO2 A 
LL02 A 


m LM01 LM02 A 
M LM02 A 


26 
26 


LZ01 LZ02 A 


Z 
Z LZ02 A 


All other characters are sorted as 


shown in Figure 3-1 on page 3-2. 


Figure 3-18. Character Table for Italy 
Notes for Figure 3-18: 


1. All accented characters and special forms of characters have the same sort weight as the regular 
nonaccented character. For example, 4 is sorted as an a, ¢ is sorted as c, fi is sorted as n, and so on. 
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Japan 


Character Set Considerations 
Japan uses the following character sets: 


e The Japanese EBCDIC Latin - A single-byte character set that contains the 
same set of Latin characters as in the USA EBCDIC. 


e The EBCDIC Katakana - A single-byte character set that contains uppercase 
Latin characters and Katakana characters. Lowercase Latin characters are 
excluded. It should be noted that some of the Katakana characters are 
assigned with the same code points that are normally assigned to the 
lowercase Latin characters in the USA EBCDIC. 


¢ The SBCS-PC Katakana - A single-byte character set that contains both 
uppercase and lowercase Latin characters and Katakana characters. 


¢ The DBCS-Host and DBCS-PC - Both are double-byte character sets and 
each consists of the following groups of characters: 


— Kanji characters 
— Non-Kanji characters containing: 
— Latin characters 
— Katakana and Hiragana characters 
— Greek and Russian characters 
— Special symbols. 
— Additional characters to be defined by users. 


SBCS Sort Sequence 


While the Japanese EBCDIC Latin is sorted the same as the USA EBCDIC, the 
EBCDIC Katakana and SBCS-PC are sorted by special algorithms. 


DBCS Sort Sequence 


Different algorithms are used for DBCS sorting, depending on the applications. 
The most commonly used ones are based on the following types of character 
sequence: 


¢ Telephone directory 
e Dictionary 
e Strokes and radicals. 
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Monocasing Considerations 


The Latin characters in the Japanese EBCDIC Latin and the SBCS-PC 
Katakana are monocased in the same manners as the USA EBCDIC and the 
US-PC code, respectively. 


Monocasing is not applicable to the EBCDIC Katakana because there are no 
lowercase Latin characters defined in the set. 


Monocasing for either the DBCS-Host or the DBCS-PC is usually not 
required unless specifically requested. 
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The Netherlands 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SAO1) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e In family names, on the first pass: 
— d’, de, de la, der, des, and du 
— het and ’t 2 
— in de, in den, in der, in het, and in 't | 
— 1’, la, and le 
— uit de, uit den, uit het, and uit ’t 


— van, van de, van den, van der, van het, van ’t, von, von der, and von 
dem. 


Include them on the second pass to provide a final sorting, as follows: 
— Bogaard 
— inden Bogaard 
— uit de Bogaard 
— van den Bogaard. 
e In city names: ’t, ’s, de, and den may be ignored. 


Character Table 


Figure 3-19 shows the character table for the Netherlands. The table lists the 
characters in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


I a LAOL LA02 A 


A LA0Q2 A 
2 b = LBOI] LBO2 A 
2 B LBO02 A 


LCO1 LC02 A 


Ce Geo 
Qe 


LC02 A 
4d  LDO1 LDO2 A 
4  D_ LDp02 A 
5 e LEO! LEO2 A 
5 E  LE02 A 
6 f  LFOL LFO2 A 
6 FF LFO02 A 
7g  LGO! LGO2 A 
7  G~ LG02 A 
8 h  LHOl LHO2 A 
8 H  LH02 A 
9 i Ll 02 A 2 
9 I LI02 A 
i0 «=j ©6©LJ0l =COLJ02 OA 
i0 J Lyjo2 A 
11 k  LKOl LK02 A 
11 K  LKo2 A 
12 1 LLOL LLO2 A 
i2 L  LLo A 
13 m LMOl LM02 A 
13 M_ LMo2 A 
14. n  LNOl LNO2 A 
14 N_ LNO2 A 


Figure 3-19. Character Table for the Netherlands 


Notes on Figure 3-19: 


1. All accented characters and special forms of characters have the same sort weight as the regular 


Seq Char ID Mono Alpha Note 
Num case /Num 


15 Oo LOO! LO02 A 
15 QO LO02 A 


16 p  LPOl LPO2 A 
16 P  LPO02 A 


17 LQ01 LQ02 A 
LQ02 A 


r LRO! LRO2 A 
I LRO2 


8 R A 
19 S LSO] LSO2 A 
19 S LS02 A 
20 t LTO1 LT02 A 
T LTd2 A 


u LU01 LU02 A 
U- LU02 A 


Vv LVO1 LV02 A 
Vv LV02 A 


w LWO!I LW02 A 
Ww LW02 A 


X LX01 LX02 A 
X LxX02 A 


y LYOl LY02 A 
Y  Ly02 A 


Z LZ01 LZ02 A 
Z LZ02 A 


All other characters are sorted as 


shown in Figure 3-1 on page 3-2. 


nonaccented character. For example, 4 is sorted as an a, ¢ is sorted as c, fi is sorted as n, and so on. 


2. Ina dictionary sort sequence, the 1j ligature is sorted as an i followed by aj. In a telephone directory 


sort sequence, the ij ligature is sorted as a y. 
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New Zealand 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


@ Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (if it has a letter ahead of and behind it, as in “don’t”) (SPOS). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


¢ Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


J 
Character Table 


Figure 3-20 shows the character table for New Zealand. The table lists the 
characters in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 

l a LAOL LAO2 A 

l A LAO2 A 

2 b LBO!1 LBO2 A 

2 B  LBO2 A 


3c  LCOL LCO2 A 
3 Cc LCco2 A 
4d  LDOl LDO2 A 
4  D- LDp0 A 
5 e LEO LEO2 A 
5 EB LEO2 A 
6 f£  LFOl LFO2 A 
6 F  LF0 A 
7 g  LGOl LG0O2 A 
7 G_ LG0 A 
8 h  LHOI LHO2 A 
8 HH LHO02 A 
9 i  LIOL LI02 A 
9 I Lyo2 A 
i0 j Isol Lyo2 A 
10 J Lyo2 A 
11k  LKOl LKo2 A 
11 K  LKO2 A 
12 1 4\LLOL LLO2 A 
12 L Lo A 
13 m LMOL LM02 A 
13. M_ LMo02 A 
14 nn  LNOI LNO2 A 
14. N_ LNO2 A 


Figure 3-20. Character Table for New Zealand 


Seq Char ID Mono Alpha Note 
Num case /Num 


15 O LOO! LOO2 A 
15 O LOO02 A 


16 p  LPOl LPO2 A 
16 P  LP02 A 


17 q LQ01 LQ02 A 
17 Q = =LQ02 A 


18 r LROL LRO2 A 
1 LRO2 A 


21 u LU01 LUO02 A 
21 U LU02 A 


22 v’ LV01 LV0O2 A 
22 Vv =LV02 A 


23 w  LWO01 LW02 A 
23 Ww LWo02 A 


24 X LX01 LX02 A 
24 xX  LX02 A 


25 y LYO!1 LY02 A 
25 Y  LY02 A 


26 2 LZ01 LZ02 A 
26 £  LZ02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 
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8 R | 
19 s S01 LSO2 A | 
19 §  LS02 A 
20 t  LTOl LT02 A | 
20 TT  LT02 A | 
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Norway 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
@ Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


® Period (SP11) 
¢ Comma (SP08) 
e Plus (SA01) 

® Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


¢ Hyphen (required and syllable) (SP10 and SP32). 


Character Table ) 


Figure 3-21 shows the character table for Norway. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 

] a LAO] LA0O2 A 

l A LA02 A 

2 b LBO!1 LBO2 A 

2 B  LBO2 A 


3c  LCOl LCO2 A 
3 Cc Lco2 A 
4d  LDOl LDO2 A 
4  D_ Lp02 A 
5 e LEO! LEO2 A 
5 E  LE02 A 
6 f  LFOl LFO2 A 
6 FF LFO02 A 
7 g  LGOl LGO2 A 
7  G_ LG0 A 
8 h  LHOl LHO2 A 
8 oH LH02 A 
9 i LI0l LI0O2 A 
9 I Lj02 A 
i0 j Wor Lyo2 A 
i0 J Lyo2 A 
11 k  LKOl LKO2 A 
11 K  LK0o2 A 
12 1 LILO! LLO2 A 
12 L  LLo2 A 
113. m  LMOl LM02 A 
13. M_ LM02 A 
14 n  LNOl LNO2 A 
14. N_ LNO2 A 
15 o LOO! LOO2 A 
15 OO LO02 A 
16 p  LPOL LPO2 A 
16 P  LP02 A 
17 q  LQO1 LQ0O2 A 


17 A 


Q = LQ02 


Figure 3-21. Character Table for Norway 


Seq 
Num 


18 
18 


19 
] 


2 
21 


22 
22 


23 
23 


24 
24 


25 
25 


26 
26 


27 
27 


28 
28 


29 
29 


Char ID Mono Alpha Note 


case /Num 


r LRO1 LRO2 A 
R  LRO2 A 


S LSO!1 LS0O2 A 
LS02 A 


9 S 
20 t LYTO1 LT02 A 
20 T  LT02 A 
] 


u LU01 LU02 A 
U LU02 A 


Vv LV01 LV02 A 
Vv LV02 A 


w LWO01 LW02 A 
WwW LW02 A 


xX LX01 LX02 A 
Xx  LX02 A 


y LYOl LY02 A 
Y  LY02 A 


LZ01 LZ02 A 


Z 
Z LZO02 A 


ez LA51 LA52 A 
AE LAS52 A 


@ LO61 LO62 A 
O LO62 A 


a LA27 LA28 A 
A LA28 


0 


Z| > 


Chapter 3. Technical Characteristics 


3-55 


People’s Republic of China 
Character Set Considerations 
The People’s Republic of China (PRC) uses two character sets: 
e A single-byte character set (SBCS) for Latin characters 
e A double-byte character set (DBCS) that contains: 
— A level 1 Chinese character set 
— A level 2 Chinese character set 
— A non-Chinese character set containing: 
— Latin, Greek, and Russian alphabets 
— Arabic, Chinese, and Roman numerals 
— Japanese Hiragana and Katakana 
— Chinese phonetic symbols 
— Special characters and symbols. 
SBCS Sort Sequence 
Latin characters are sorted like USA English. 
DBCS Sort Sequence 
The DBCS sort sequence is done by using different sorting algorithms such as: 
e Phonetic sequence 
e Radicals within strokes 
e Strokes within radicals. 
Monocasing Considerations 


e SBCS Latin characters are monocased the same as USA English, 


¢ DBCS does not have uppercase and lowercases. 
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Poland 
Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


¢ Hyphen (required and syllable) (SP10 and SP32). 
Character Table 


Figure 3-22 shows the character table for Poland. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


a LAO! LA02 A 
A LAQ2 A 


a LA43 LA44 A 
LA44 


Seq. Char ID Mono Alpha Note | 
Num case /Num 


19 mn  LNI1 LNI2 A 
19 N  LN12 A 


LOO! LO02 A 
A 


pnt pets 


Nw 
o> 
> 


3b  LBOl LBO2 A 6  LOll LOI2 A 
3  B- LBo2 A 21 O LOl2 A 
4 c¢  LCOl LCO2 A 22 p LPO! LPO2 A 
4 C_ LC02 A 22 P  LP02 A 
5 € LCll LCl2 A 23 q  LQ01 LQ02 A 
5 ¢€ Lcel2 A 23 Q_ LaQ0 A 
6 d  LDOl LDO2 A 24 +r  LROl LRO2 A 
6 D- Lp02 A 24 R_ LRO2 A 
7 e LEO! LEO2 A 25 LSO1 LS02 A 


E  LEO2 A LS02 A 


: 


8 e LE43 LE44 A LS11 LS12 A 
8 E  LE4 A 26 LS12 A 
9 f LFOl LFO2 A 277 «t LTO1 LT02 A 
9 F  LFO02 A 27 TT ~ LTO A 
10 g  LGOl LGO2 A 28 u LUOL LU0O2 A 
10 G  LG02 A 22 U- LuU02 A 
11 h  LHO1 LHO2 A 29 vv LVO1l LV02 A 
11 H~ LHO02 A 29 VV Lv02 A 
12 i L101 LiIo2 A 30 ow LWO01 LW02 A 
12 I Lio A 30 W- Lwo2 A 
300 j LJOl LJO2 A 31 «x  LX0l LX02 A 
133. J LJo2 A 31 xX  Lxo2 A 
14 bk  LKOl LKO2 A 32 y  LYOl LY02 A 
14 K  LK0o2 A 32 Y  LY02 A 
5 1 LLO1 LLO2 A 33 +z  LZOl LZ02 A 
15 L LL A 33 Z  LZO02 A 
16 6 t LL61 LL62 A 34 Zz LZIl LZ12 A 
16 bk LL62 A 34 ZZ) COLZ12 A 
17m LMOl LM0O2 A 35 2 LZ29 LZ30 A 
17 M-~ LM02 A 35 Z  LZ30 A 
118 nm LNOl LNO2 A 


All other characters are sorted as 
N  LNO2 shown in Figure 3-1 on page 3-2. 


A 


Figure 3-22. Character Table for Poland 
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Portugal 
Character Classification Considerations 
The following special characters are classified as alphabetic: 
e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 
e Apostrophe (if 1t has a letter ahead of and behind it) (SPOS). 
The following special characters are considered numeric: 
Period (SP11) 
Comma (SP08) 


Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32). 


The following character strings should be given a sort weight between spaces and 
alphabetic characters, if they are at the start of a sort field and if they are 
immediately followed by a space: 


e d’, D’, da Da, das, Das, de, De, do, Do, dos, and Dos 
@ oe. 


Character Table 
Figure 3-23 shows the character table for Portugal. The table lists the characters 
in ascending order with the lowest-weighted character listed first. In Portugal, 


some characters are regarded as equivalents and are listed in the table with equal 
weights. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


Seq Char ID Mono Alpha Note 
Num case /Num 


10 j Lol Lyo2 A 
10 J Lyo2 A 
k 


1] LKO] LK0O2 A 


ll K  LK02 A 
12 1 LLO1 LLO2 A 
12 L  LLO02 A 
13) m  LMO0Ol LM02 A 
13 M LM02 A 
14 n  LNO1 LNO2 A 


14 N  LNO2 


> 


A 

A 

15 te) LO1ll1 LOI2 A 

15 O LOI2 A 

c LCOl LC0O2 A 15 te) LO13 LOl4 A 

C LC02 A 15 O LO14 A 

¢ LC41 LC42 A 15 6 LO15 LOI6 A 

¢ LC42 A 15 QO LOI6 A 

d LDO!1 LDO2 A 15 6  LOI9 LO20 A 

D  LD02 A 15 O LOQ20 A 

5 LEO! LEO2 A IS o SM20 A 
5 E  LE02 A 

5 é LE11 LE12 A 16 p  LPOl LPO2 A 

5 EK LE12 A 16 P  LP02 A 

3 e LE13 LE14 A 17 q LQ01 LQ02 A 

5 EK LEI4 A 17 Q = LQ02 A 

5 LEIS LE16 A 18 =o LRO] LRO2 A 

5 LE16 A 18 R  LRO2 A 

6 19 S LSO1 LS0O2 A 

19 S LS02 A 

LGOi LG02 A 20 LTOi LTO2 A 


G LG02 A 


Figure 3-23 (Part 1 of 2). Character Table for Portugal 
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Seq Char ID Mono Alpha Note 
Num case /Num 


22 Vv LVO1 LV02 A 
22 WV LV02 A 


2 


23 w  LWO1 LW02 A 
23 Ww LW02 A 


24 X LX0O1 LX02 A 
24 x  LX02 A 


Figure 3-23 (Part 2 of 2). Character Table for Portugal 


Seq Char ID 
Num 


25 y  LYOl LY02 A 
25 Y  LY02 A 
6 


Zz LZOl LZ02 A 
26 Z 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


LZ02 


Mono Alpha Note 
case /Num 
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Republic of China 


Character Set Considerations 
ROC (Taiwan being the old name) uses two character sets: 
e A single-byte character set (SBCS) for Latin characters 
e A double-byte character set (DBCS) that contains: 
— A primary Chinese character set 
— A secondary Chinese character set 
— A non-Chinese character set containing: 
— Latm, Greek, and Russian alphabets 
— Arabic, Chinese, and Roman numerals 
— Japanese Hiragana and Katakana 
— Chinese phonetic symbols 
— Special characters and symbols. 
SBCS Sort Sequence 
This sort sequence is by prespecified tables. 
DBCS Sort Sequence 


DBCS characters are sorted using different sorting algorithms, depending on the 
following sort sequences: 


e Strokes within radicals 
e Radicals within strokes 
e Phonetic sequence. 

Monocasing Considerations 


e Latin characters on the SBCSs are monocased the same as USA English. 


e Chinese characters do not have uppercase and lowercases. 
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& 


Republic of Korea (South Korea) 
Character Set Considerations 
Korea uses two character sets: 


e A single-byte character set (SBCS) containing 147 characters made up of 
Latin characters and Jamo components of Hangeul characters 


e A double-byte character set (DBCS) that contains: 
— A Hangeul character set 
— A Hanja character set 
— A non-Hangeul and non-Hanja character set containing: 
— Latin characters 
— Jamo components 
— Hiragana, Katakana, Greek, special characters, and Roman numerals. 
SBCS Sort Sequence 
This sort sequence is by prespecified tables. 
( DBCS Sort Sequence 


e Hangeul characters are sorted by binary value, because they are arranged in 
dictionary sequence. 


e Hanja characters are sorted by phonetic sequence. 

e Non-Hangeul and non-Hanja characters are sorted by binary value. 
Monocasing Considerations 

e Latin characters on the SBCSs are monocased the same as USA English, 


e Jamo components, Hangeul characters, and Hanja characters do not have 
uppercase and lowercases. 
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Romania (Rumania) 
Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA0Q0). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


¢ Hyphen (required and syllable) (SP10 and SP32) 
e <Apostrophe (SPOS). 


Character Table 


Figure 3-24 shows the character table for Romania. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 

l a LAO] LA02 A 

l A LA0Q2 A 

2 a LA23 LA24 A 

2 A LA24 A 

3 a LAIS LAI6 A 

3 A LAI6 A 


4b LBOl LBO2 A 
4 B- LBO2 A 
5c LCOL Lco2 A 
5 CC Lc02 A 
6 d  LDpOol LDO2 A 
6 D_ LpD02 A 
7 e LEO! LEO2 A 
7 EB LEO2 A 
8 f  LFOl LFO2 A 
8 oF LFO02 A 
9 g  LGOl LGO2 A 
9 G_ LG02 A 
10 h  LHOl LHO2 A 
i0 H  LH02 A 
11 i Lol Loz A 
11 I Lio2 A 
12 i Lis LN6 A 
i2 i LI6 A 
13 j Lol LJO2 A 
J LJo2 


LKO1 
LK02 


LLO1 LLO2 A 
LL02 A 


m LM01 LM02 A 
M LM02 A 


LKO02 A 
A 


Figure 3-24. Character Table for Romania 


Seq Char ID Mono Alpha Note 
Num case /Num 


17 n LNOI LNO2 A 
17 N  LNO2 A 


18 Oo LOO! LO02 A 
] LO02 A 


8 O 
19 p LPO LPO2 A 
i9 P LPO A 
20 q  LQ0l LQ02 A 
2 Q~~=LON2 A 
LRO1 LRO2 A 
LRO2 A 


S LSO1 LSO2 A 
22 S LS02 A 


yom 


233 +s LS4 LS42 A 
23 § L842 A 
24 +t + #LTOl LTO2 A 
24 T ~ LT02 A 
25 + #LYT41 LT42 A 
25 TT  LT4 A 
26 u LUOI LU02 A 

U _Luo2 A 


V LV01 LV02 A 
27 Vv LV02 A 


28 w LWOL LW02 A 
28 Ww LW02 A 


29 X LX01 LX02 A 
29 XxX  LX02 A 


30 y  LYOl LY02 A 
Y  LyY02 A 


Z LZ01 LZ02 A 
31 Z LZO02 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 
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South Africa 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Hyphen (required and syllable) (SP10 and SP32) 
e <Apostrophe (SP05). 


The following special characters are considered numeric: 


Numeric space (SP31, as thousands separator) 
Comma (SP08) 

Period (SP 11) 

Plus (SA01) 

Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
¢ <Apostrophe (SP05). 


Character Table 2 


The table on Figure 3-25 lists the characters in ascending order with the 
lowest-weighted character listed first. In South Africa, some characters are 
regarded as equivalents and are listed in the table with equal weights. After a sort, 
these characters can be intermixed like, for example, the following words: 


Unsorted list Sorted list 
leérmag leer 

léer leér 
leerbock léer 

leér leérafdeling 
leer leerbock 
leérafdeling leérmag 
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Seq Char ID Mono Alpha Note 
Num case /Num 


LAO! LA02 A 
A LA0Q2 A 


Seq Char ID Mono Alpha Note 
Num case /Num 


9 g ~ LGO1 LGO2 A 
9 G_ LG02 A 


It 

I 

I a LAI] LAI2 A 2 10 h LHO1 LH02 A 
I A LAI2 A 10 H  LH02 A 
I a LAI3 LAI4 A 1 LI02 A 
I A LAI4 A 1] I LI02 A 
I a LAI5 LAI6 A Il 1 Lill Li2 A 
It A LAI6 A I] I LI12 A 
jt a LAI7 LAI8& A I] i LIW3 Lil4 A 
I A LAI8 A 1] I LIi4 A 
I a LAI9 LA20 A 1] 1 LIiS LIl6 A 
I A LA20 A 1] | LI16 A 
I a LA27 LA28 A 1] i LIl7 LIl8 A 
I A LA28 A I A 
I z LAS! LAS2 A 1 A 
l fA =LAS52 

2 


LJOl 
LJ02 


LKO1 
LK02 


LJ02 A 
A 


LK02 A 
A 


N 


we We 
Qe |B 


b LBO! LBO2 A 
LBO02 A 


LCO! LC02 A 
LC02 A 


4 ¢  LC4l LC42 A LLO1 LLO2 A 
4 ¢ Lc42 A LL02 A 
5d LDO! LDO2 A m LMO01 LM02 A 
5 D_ LD02 A 15 M_ LMO02 A 
6 68 LD63 LD62 A 16 nm LNOL LNO2 A 
6 BP LD A 16 N_ LNO2 A 
7 A 17. &  LNI9 LN20 A 
7 A 17 N  LN20 A 
7 A A 
7 A A 
7 A A 
7 A A 
7 A A 
7 A A 
7 A A 
7 A A 
8 LFOl LFO02 A A 
8 LF02 A A 


Figure 3-25 (Part | of 2). Character Table for South Africa 
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Seq Char ID Mono Alpha Note 
case /Num 


LOI9 LO20 A 
LO20 A 
LO61 LO62 A 
LOQ62 A 


LPO] LP02 A 
LP02 A 


Seq Char ID Mono Alpha Note 
Num case /Num 


LQO!I LQ02 A Vv LV01 LV02 A 
LQ02 A 27 V LV02 A 
LRO! LRO2 A 28 w LWO1 LW02 A 
LRO02 A 28 W LW02 A 
LSO1 LSO2 A 29 X LX01 LX02 A 
LS02 A 29 x  LX02 A 


23 iB LS61 LS6l A 


24 ft LT01 LY02 A 
24 T  LT02 A 


LT63 LYT64 A 
LT64 A 


LY0O1 LY02 A 


30 Y  LY02 A 
30 y LY11 LY12 A 


Y  LY12 A 
LY17 LY02 A 


3] Z LZ01 LZ02 A 
3 LZ02 A 


] Z 
All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Figure 3-25 (Part 2 of 2). Character Table for South Africa 


LU01 LU02 A 
LU02 A 


LUI1 LUI2 A 
LU12 A 


Notes for Figure 3-25: 


1. Small and capital variations of the same letter, with or without diacritics, have the same sort 
weight. They can be sequenced according to the Character Identifier. 


2. For printers supporting character subsets because of technical or printing speed limitations, 
all the accented vowels are monocased to their corresponding nonaccented capital vowel. 
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Spain 
Character Classification Considerations 
The following special characters are classified as alphabetic: 


Middle dot (SD63) 

Space (required and numeric) (SP30 and SP31) 

Hyphen (required and syllable) (SP10 and SP32) 

Apostrophe (if it has a letter ahead of and behind it, as in “s’havia”) (SP05) 
All numbers (0-9). 


The following special characters are considered numeric: 


Superscript 2 and superscript 3 (ND021 and NDO31) 
One-half, one-quarter, and three-quarters (NF0O1, NF04, and NF05) 
Minus, plus, and plus-or-minus (SA00, SA01, and SA02) 
Less-than, equal, and greater-than (SA03, SA04, and SA0S5) 
Pound sign and dollar sign (SC02 and SC03) 

Percent and asterisk (SM02 and SM04) 

Square brackets (left and right) (SM06 and SM08) 

Braces (left and nght) (SM11 and SM14) 

Micro symbol (SM17) 

Degree (temperature) symbol (SM19) 

Parentheses (left and right) (SP06 and SP07) 

Comma, period and slash (SP08, SP11 and SP12) 

Space (SP31). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Middle dot (SD63) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


The following character strings should be given a sort weight between spaces and 
alphabetic characters, if they are at the start of a sort field and if they are 
immediately followed by a space: 


e d’, D’, de, De, del, and Del 


e el and El 
e 1’, L’, la, and La. 
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Middle Dot Location in the Character Matrix J 


e Honzontal position: at the center of the box 
¢ Vertical position: at the height of the hyphen. 


In proportional spacing printers, the width of the middle dot should be the same 
as the width of the small i. The size should be the size of the dot over the small 
i. If, due to technical limitations, this size is not feasible, then the size of the 
normal period should be provided. 


Character Table 


The table on Figure 3-26 lists the characters in ascending order with the 
lowest-weighted character listed first. In Spain, some characters are regarded as 
equivalents and are listed in the table with equal weights. After a sort, these 
characters can be intermixed like, for example, the following words: 


Unsorted list Sorted list 

candor candido 

casa candor 

Cantera cantaro 

Castro Cantera 

candido casa 

cantaro cascara 2 
cascara Castro 
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Seq Char ID Mono Alpha Note 
Num case /Num 


RQ eS ee oe 
rr PrPrPrPrrrrryp PrP ry 


b  LBOL LBO2 A 
B  LBO02 A 
c LCOl LCO2 A 
C  LCo02 A 
¢ LC41 LC4 A 
GC LC4 


3 


& ff ] Go G | tO 


Aan 
298 
>>> > 


d LDO! LD02 

D  LD02 

) LD63 LD62 
| D LD62 


ON ON 


> >| > > 


Oo 00 06 CO CO COCO CO CWO COO] I ~) 


Ne) 
Pri rr Prrrrrr ry 


ery > | erg: Erp EO ErHOD~ ET] 


LFO1 LF02 
LF02 


\O 


Seq Char ID Mono Alpha Note 


Num case /Num 
10 g LG0O! LGO2 A 
10 G LG02 A 


1] h LHO!1 LHO2 A 
1] H LH02 


> 


12 i LiOl Lio2 A 
12 I LI02 A 
12 i Lill Li2 A 
12 If Li A 
12 i LiM3 Lil4 A 
12 I Li A 
12 i LS Li6 A 
12 f Li6 A 
12 i LIl7 LU8 A 
12 ff  LII8 A 


2 
13 j LJOl LJo2 A 
13 J Lyjo2 A 
14 k  LKOL LKO2 A 
14 K  LKO2 A 


15 I LLO! LLO2 A 
15 LL02 


17 m LM0l LM02 A 


17 M_ LMo2 A 
18 nn  LNOI LNO2 A 
18 N  LNO2 A 
19 &  LNI9 LN20 A 
i9 N_ LN20 A 


>> D> DD D> D> 


0:9? Om O-O- CriO-O © 


Figure 3-26 (Part 1 of 2). Character Table for Spain 
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Seq Char ID 
Num case /Num 


6 
20 O- L020 A 
%) 


Pp 
21 P LP02 A 


22 q LQ01 LQ02 A 29 Vv LVO1 LV02 A 
22 Q = LQO2 A 29 V LV02 A 
23 r LROL LRO2 A 30 w  LWO!L LW02 A 
23 R  LRO2 A 30 Ww LW02 A 
4 S LSO1 LSO2 A 31 x LX01 LX02 A 
4 S LS02 A 3] X  LxX02 
5 B LS61 LS61 A y LYOl1 
26 t LTO1 LT02 A 32 Y  LyY02 
26 T  LYT02 A 32 y LY11 LY12 


27 
27 


Mono Alpha Note Seq Char ID Mono Alpha Note 


Num case /Num 


LO19 LO20 A 


LO61 LO62 A 
LO62 


LPO1 


A 
LPO2 A 


Y LYl2 
¥  LYI7 


33 z  LZOl LZ02 A 
133 «Z~SCOLLZ02 A 


| All other characters are sorted as 
| shown in Figure 3-1 on page 3-2. 


LT63 LIT64 A 
LT64 A 


4 1 


c 


Figure 3-26 (Part 2 of 2). Character Table for Spain 


Notes for Figure 3-26: 


l. 


2. 


Spain uses character subsets for printers with technical or printing speed limitations. 


Small and capital variations of the same letter, with or without diacritics, have the same sort 
weight. They can be sequence according to the Character Identifier. 


The combinations ch, Ch, and CH are printed, displayed, and otherwise processed as two characters. 
During sorting, each combination is processed as one character that follows all other characters. 
For example, the name Cuenca 1s sorted ahead of Chinchon. 


The combinations ll, Ll, and LL are printed, displayed, and otherwise processed as two characters. 
During sorting, each combination is processed as one character that follows all other | characters. 
For example, the name Ldpez is sorted ahead of Liorente. 


The combination I! and LL are printed, displayed, and otherwise processed as three characters. 
During sorting, each combination is processed as one character, following all other | characters, 
but equivalent to the ll, Ll, and LL combinations. For example, the following four names are 
sorted: Columpio, Collar, Collecci6, and Collerén. 
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C Sweden 


Character Classification Considerations 
The following characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (when used within words) (SP05). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00) 
Asterisk (SM04). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
4 | e Apostrophe (SP05). 


Character Table 


Figure 3-27 shows the character table for Sweden. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note Seq Char ID Mono Alpha Note 


Num case /Num Num case /Num 
are sorted ahead of the numbers. 21 L LLO02 A 

0 0 ND10 N 22 m LM01 LM02 A 

22 M__LM02 A 

23. on  LNOL LNO2 A 


N 
2 N 
3 3  NDO3 N 23 N_ LNO2 A 
4 4 ND04 N 24 +o LOO! LOO2 A 
5 5  NDOS N 24 O LO A 
6 «6 N 25 p LPO! LPO2 A 
N 25 P  LPp02 A 
N 2 q  LQO!l LQ02 A 
N 2 Q  LQ02 A 
LAO! LAQ2 A 27 +r  LROl LRO2 A 
10 A LAO2 A 27 +R LR02 A 
a  LAI3 LAI4 A 28s  LSOL LS02 A 
A LAI4 A 28 S_ LS02 A 
11 b  LBOl LBO2 A 29 +t +~ +\LTOl LTO2 A 
11 B LBO2 A 29 T~ LT02 A 
12 c  LCOl LCO2 A 30 u LUO! LUO2 A 
12 C  Lco02 A 30 U  Luo2 A 
13 +d LDOl LDO2 A 31 vLVOl LV02 A 
133 D  LDbD02 A 31 V Lv02 A 
e LEO! LEO2 A 32. w  LWOL LWo2 A 
14 E LEO A 32 W  LWo2 A 
14 € LEI LEI2 A 33. x LX0l LX02 A 
14 EB LBI2 A xX  LX02 A 
15 f  LFOL LFO2 A y LY0l LYy02 A 
15 F  LF0 A 34 =Y  Ly02 A 
16 g  LGO! LGO2 A i = LUI7 LUI8 A 
A 


16 G  LG02 A U —_LUI8 
17h  LHOl LH02 A 35 z  LZOl LZ02 A 
17 H  LHO02 A 35 Z  Lz02 A 
18 i LI0l Lio2 A 36 A  LA27 LA2® A 
36 A LA28 A 
LJO1 LJo2 37. & LAI7 LAI8& A 
i9 J Lyjo2 A 37 A LAI8 A 


20 k LKO1 LKO2 A 
K LKO2 A 


6  LOI7 LOI8 A 
O  LOI8 A 


Figure 3-27. Character Table for Sweden 
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C Switzerland 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Required space (SP30) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (if it has a letter ahead of and behind it, as in “l’amour’’) (SP05). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Monocasing Considerations 


While the letters A, O, and U are contained in the Swiss character set, they are 
not contained in the Swiss keyboard as a single key. They can be created with 
the aid of the nonescaping (dead) key containing the diaeresis. However, it is 
common in Switzerland to add the letter e after the letters A, O, or U to 
represent A, O, and U, as follows: 


+d e Aerger represents Arger (Anger) 
® Oe6el represents Ol (Oil) 
e Uebersicht represents Ubersicht (Overview). 


The above Swiss words monocase to AERGER, OEL, and UEBERSICHT, but 
the lowercase letters 4, 6, and ti monocase to A, O, and U, respectively. 
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Sort Sequence Considerations 2 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


® Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


The following character strings should be given a sort weight between spaces and 
alphabetic characters, if they are at the beginning of a sort field: 


e d’, and D’ 

e 1’, and L’. 

The following character strings should be given a sort weight between spaces and 
alphabetic characters, if they are at the beginning of a sort field and if they are 


immediately followed by a space: 


e de, De, des, Des, and du 
e la, La, le, Le, les, and Les. 


Character Table 


Figure 3-28 shows the character table for Switzerland, which is based on the 
Multilingual Code Page (MLP). Only a subset of the MLP is actually in use. ) 
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Seq Char ID 


LAI] 
A LAI2 


A LAI4 
A LAI6 


WWINN NN NNNNNNNN NW LY 


LCOl 
LC02 


Qelaos 


: 


LGO1 
LG02 


ga 


1] G 


Figure 3-28 (Part 1 of 2). 


Mono Alpha Note 


LA12 A 


4  LAI3 LAl4 A 
4 LAIS LAI6 A 
4 LAI7 LAI& A 3 


LCQ2 A 


10 LFO! LFO2 A 
10 LF02 A 
11 


LG02 A 


Seq 


Num case /Num Num 
l a LAO] LAO2 A ] 12 
l A LAQ2 A 12 


13 


A 13 


A 
14 
14 
14 


A 


LA18 A 3 14 
4 LAI9 LA20 A 14 
A LA20 A 1 
A LA27 LA28 A 
A LA28 A 
e LASI LA5S2 A 
A 16 
LBOl LBO2 A 17 
LBO2 A 17 


18 


A 18 


5 c LC4l LC42 A 19 
5 C Lce4 A 9 
6 d LDO1 LDO2 A 2 
6 D  LD02_ A 2 
7 5 LD63 LD6é2 A 2 
7 PD LD62 A 
8 e LEO] LEO2 A 22 
8 E  LE02 A 
9 é  LEll LE12 A 
9 E  LEl2 A 
9 é LEI3 LEl4 A 
9 E LEI4 A 
9 é LEIS LEI6 A 
9 3 =~ SC LEI6 A 
9 é LEI7 LEI8& A 
9 ii A 

f 

F 


A 


Character Table for Switzerland 


Char ID 


h 
H 


1 
I 


K 


I 
L 


m 


Mono Alpha Note 
case /Num 


LHO!1 LHO2 A 
LH02 A 


LIOl1 LI02 A 
LI02 


> 


a a a a 


a, 4 
15 j  LJol Loz A 

15 J Ljo2 A | 
16 k 


LKO!1 LK02 A 
LKO02 A 


LLO!1 LLO2 A 
LLO2 A 


LMO! LM02 A 


M  LMo02 A 


n 


oO: Oo O-o- Oro 


LNO!] LNO2 A 


LNO2 A 
A 


l N 

0 fh .LNI9 LN20 A 
0 N  LN20 

O LOO! LOO02 A 
21 O LOO02 


LPO1 LP02 


PPL >> FrPrPrr rrr PPr|> 
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Seq Char ID Mono Alpha Note 
Num case /Num 


Seq Char ID Mono Alpha Note 
Num case /Num 


24 q LOO! LQO2 A 31 & LUI7 LUI8 A 3 
2% Q  LQ02 A 31 U  LUI8 A 3 
25 +:  LROl LRO2 A 32. v LV0l LV02 A 
125 R LRO0O A 32 VV Lv02 A 
26s  LSOl LSO2 A 33 w  LWOl LW02 A 


| 26 S LS02 A 33 W LW02 A 


LS61 LS6l1 A 4 34 x LX0l1 LX02 A 

34 xX  LXQ2 A 

LTO1 LT02 A 35 LYO1 LY02 A 

28 LT02 A 35 Y  LY02 A 


29 b LT63 LT6é4 A 36—iCiY LY11 LY12 
29 Bb  LT64 A 36 Y  LY12 
u ~=LUO01 LU02 A 36—s¥ LY17 LY02 


37 Z LZ01 LZ02 A 
37 Z LZ02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 
Figure 3-28 (Part 2 of 2). Character Table for Switzerland 
Notes for Figure 3-14: 
1. Small letters precede the capital letters, and letters without diacritics precede letters with diacritics. 
2. Equal letters with different diacritics are sequenced according to the Character Identifier. 
3. When the sort is according to the sound of the words as in telephone directories, address lists, then: 
® 4 is sorted as ae, 6 1s sorted as oe, and ii is sorted as ue. 
For other sorts such as dictionaries, encyclopedias, then: 
® 41s sorted as a, 6 is sorted as 0, and ii is sorted as u. 


4. B (LS61) is sorted as ss. 


3-78 National Language Information and Design Guide Volume 2 


C 


Thailand 


Character Classification Considerations 


The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


Thailand uses a single-byte character set (SBCS) for Latin characters and Thai 


character components (vowels, consonants, numerals, and tone marks). 


The Latin characters contained in the Thai SBCS are sorted the same as USA 


English. 


User sort exits are needed to sort USA English text and to support the Thai 


Telephone Directory and Thai Dictionary sort sequence. 


Character Table 


Figure 3-29 shows the Thai-character table for Thailand. The table lists the 
characters in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


l BK10 A 


Z BK20 A 
3 BK30 A 
4 BK40 A 
5 BK50 A 
6 BK60 A 
a BN10 A 
8 BC10 A 
9 BX10 A 
10 BS 10 A 
11 BX20 A 


12 BX30 


> 


13 BY 10 A 
14 BD10 A 
15 BT 10 A 


16 BT 20 | 


> 


17 BT30 A 
18 BYT40 A 
19 BN20 A 
20 BD20 A 
21 BT50 A 
22 BT60 A 


23 BT70 A 
24 BT80 A 
25 BN30 A 
26 BB10 A 


28 BP20 
29 BF 10 
30 BP30 
3] BF20 


> 


>| >| > 


Seq Char ID Mono Alpha Note 
Num case /Num 


36 BR20 


39 BW10 


40 BS20 
] 


>| > 


>>| > 


>>| > 


44 BL30 
47 BQ20 
50 BA30 
55 BU20 
57 BU40 
58 BQ30 


59 BE20 
60 BE30 


61 BO20 
62 BASO 


>| > 


0 ol ead Ea el cel Coal Cel oa 


>| > 


ed eal el oll oe call 


: 


> 


Figure 3-29 (Part 1 of 2). Thai-Character Table for Thailand 
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Seq Char ID 
Num 


Mono Alpha Note 


Seq Char ID Mono Alpha Note 
Num case /Num 


ND0O10002 
ND020002 
76 ND030002 S 
ND040002 
ND050002 


3 S 


8 SC13 


Figure 3-29 (Part 2 of 2). Thai-Character Table for Thailand 


” Notes on Figure 3-29: 


1. The “Char” column is left blank as Thai characters are not printable on this system at the present time. 
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Turkey 


Character Classification Considerations 

No special characters are classified as alphabetic. 

The following special characters are considered numeric: 
¢ Comma (SP08) (used as a decimal separator) 

e Minus (SA00) 

¢ Plus (SA01). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e All special characters (that is, all characters except those shown in the 
character table). 


Character Table 


Figure 3-30 shows the character table for Turkey. The table lists the characters 

in ascending order with the lowest-weighted character listed first. In Turkey, | 
some characters are regarded as equivalents and are listed in the table with equal ) 
weights. 


3-82 National Language Information and Design Guide Volume 2 


Seq Char ID Mono Alpha Note 


Num case /Num 
l a LAO] LAO2 A 

l A LA0Q2 A 

l a LAIS LAI6 A 

l A 

2 


LBO! LBO2 A 


oe 
ve] 
oS 
S 
> 


Www) 
Qepas 


LCOl LCO2 A 


LC02 A 
4 ¢ LC4l LC42 A 
4 C¢ Lc A 
5 d  LDO1 LDO2 A 
5 D_ LD02 A 
6 e  LEOl LE02 A 
6 E_ LE0 A 


LFO!l LFO2 A 
LF02 A 


“s~w 
9 a 


mj 


8 g LGOl LG02 A 
8 G _LG02 A 
9 £ LG23 LG244 A 
9 G LG24 A 
10 h  LHO01 LHO2 A 
10 H  LHO02 A 


LI61 LI02 A 


i LI01 LI30 
12 I LO A 


LIS) = =LI16 


14 k LKO1 LK02 A 
l LK02 A 
A 


4 K 

15 L  LL02 

16 M_  LMO02 A 

17 =n  LNO1 LNO2 A 

17 N  LNO2 A 


Figure 3-30. Character Table for Turkey 


Seq Char ID Mono Alpha Note 
Num case /Num 


18 o  LOOl LOO2 A 
18 O  LO02 A 
19 6 LOI7 LOI8 A 
i9 © LOI8 A 
20 p LPO! LPO2 A 
20 PP LP02 A 
21 q  LQOl LQ02 A 
21 Q  LQ02 A 
22 +r  LROl LRO2 A 
22 +R  LRO2 A 
23s LSOl LSO02 A 
23 §  Lso2 A 


24 § LS41 LS42 A 
24 5 LS42 A 
25 t LTO1 LT02 A 


LT02 A 
LUO! LU02 A 
LU02 A 


LUI5 LUI16 A 
2 LU16 


6 A 
27 + LUI7 LUI8 A 
27 +U  LUIB A 
28 =v LVOl LV02 A 
28 V  Lv02 A 


29 w  LWO01 LW02 A 
29 W LW02 A 


30 X LX01 LX02 A 
30 X  LxX02 A 


31 y LYO1 LY02 A 
31 Y  LYO2 A 


32 
32 


WN 
ON 
eit =z S 


and so on for other numerals... 
41 8 ND08 N 


42 9 NDO9 N 
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The United Kingdom 


Character Classification Considerations 

The following special characters are classified as alphabetic: 

e Space (required and numeric) (SP30 and SP31) 

e Hyphen (required and syllable) (SP10 and SP32) 

e Apostrophe (if it has a letter ahead of and behind it, as in “don’t’’) (SP05). 


The following special characters are considered numeric: 


e Period (SP11) 
@¢ Comma (SP08) 
e Plus (SA01) 

@ 


Minus (SA00). 
Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


S) 


Character Table 


Figure 3-31 shows the character table for the UK. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 


l a LAO] LA02 A 
A LA02 A 
2 b LBO! LBO2 A 
2 B LBOQ2 A 
3 c LCOl1 LC02 A 
3 C LCO02 A 
4 d LDO1 LDO2 A 
4 D  LD02 A 


LEO1 LE02 A 


aa 
tri @ 


LE02 A 
6  f  LFO! LFO2 A 
6  F  LFO02 A 
7 ~g  LGOL LGO2 A 
7  G_ LG02 A 
8 h  LHOl LHO2 A 
8 H_ LH02 A 
9 i LI0l LI02 A 
9 I Li02 A 


10 j LJOl LJo2 A 
10 J LJ02 A 
11k  LKOL LKO2 A 
11K __LK02 A 
12 1 ILO0l LLO2 A 
12 L_ LL02 A 
13. m  LMOl LM02 A 


M LM02 


n LNOI 
N  LN02 


A 


LNO2 A 
A 


Figure 3-31. Character Table for the UK 


Seq Char ID Mono Alpha Note 
Num case /Num 


15 o LOO! LOO2 A 
15 O  Lo02 A 
i6 p  LPOl LPO2 A 
16 P  LP02 A 
17 q  LQO! LQ02 A 
17 Q  LQo2 A 


r LROI LRO2 A 
R  LRO2 A 


LSO! LSO2 A 
LS02 A 


LTO! LYT02 A 
LT02 A 


LUO] LU02 A 
LU02 A 


LVOl1 LV02 A 
LV02 A 


w LWO0O1 LW02 A 
Ww LW02 A 


X LX01 LX02 A 
24 x  LX02 A 


LYO1 LY02 A 
LY02 A 


Zz LZO1 LZ02 A 
Z  LZ02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Chapter 3. Technical Characteristics 


3-85 


The United States 


Character Classification Considerations 
The following special characters are classified as alphabetic: 


e Space (required and numeric) (SP30 and SP31) 
e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (if it has a letter ahead of and behind it, as in “don’t”) (SP05). 


The following special characters are considered numeric: 


Period (SP11) 
Comma (SP08) 
Plus (SA01) 
Minus (SA00). 


Sort Sequence Considerations 


The following characters should be given a sort weight between spaces and 
alphabetic characters: 


e Hyphen (required and syllable) (SP10 and SP32) 
e Apostrophe (SP05). 


| ) 
Character Table 


Figure 3-32 shows the character table for the USA. The table lists the characters 
in ascending order with the lowest-weighted character listed first. 
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Seq Char ID Mono Alpha Note 
Num case /Num 

l a LAO] LA0O2 A 

l A LAOQ2 A 

2 b LBO1 LBO2 A 

2 B LBO02 A 


3 c LCO!] LC02 A 
3 C LC02 A 
d LDO!1 LDO2 A 
D  LD02 A 


Mm & 


Na 
ola Bes i. 


w— 


LEO1 LE02 A 
LE02 A 


LFO1] LFO2 A 
LF02 A 


7g LGO! LGO2 A 
7  G~ LGO02 A 
8 oh  LHOl LHO2 A 
8 oH LH02 A 
9 i  Li0l Lo2 A 
9 I LI02 A 
10 j LJol Ljo2 A 
10 83 Lyo2 A 
i! k  LKOl LK02 A 
11 K  LKo2 A 
12 1 LLOl LLO2 A 
12 L  LLo A 
13 m  LMOl LM02 A 
13 M  LMo2 A 
14. mn LNO] LNO2 A 


N  LN02 A 


Figure 3-32. Character Table for the USA 


Seq Char ID Mono Alpha Note 
Num case /Num 


15 Oo LOO! LO0O2 A 
15 O  LO02 A 


16 p LPO! LPO2 A 
16 P  LP02 A 


17 LQ01 LQ02 A 
LQ02 A 


r LROl LRO2 A 


1 LR02 A 


8 R 
19 S LSO!1 LS0O2 A 
19 5 LS02 A 
20 t LTO1 LT02 A 
20 T  LT02 A 
21 


u LU01 LU02 A 
21 -U  LU02 A 


22 Vv LV0O1 LV02 A 
22 Vv LV02 A 


23 w LWO01 LW02 A 
23 Ww LW02 A 


24 X LX01 LX02 A 
24 XxX  LX02 A 


25 y LYO1 LY02 A 
25 Y  LY02 A 


26 Z LZ01 LZ02 A 
26 Z LZ02 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Chapter 3. Technical Characteristics 


3-87 


Yugoslavia 
Character Classification Considerations 
Figure 3-33 shows the Latin-character table for Yugoslavia. Figure 3-34 shows the Cyrillic-character table 


for Yugoslavia. The tables list the characters in ascending order with the lowest-weighted character listed 
first. 


Seq Char ID Mono Alpha Note Seq Char ID Mono Alpha Note 
Num case /Num Num case /Num 

l a LAOL LAO2 A 17 n LNOL LNO2 A 

l A LA02 A 17 N  LNO2 A 

2 b LBO! LBO2 A 18 O LOO! LOO02 A 

2 LBO02 A 18 O LOO02 A 


B 
c LCOl LC02 A 
C LC02 A 


¢ LCll LCl2 A 

LCl2 A 

LC21 LC22 A 
LC22 A 


LDO1 LDO2 A 
LD02 A 


LD61 LD62 A 
LD62 A 


e LEO] LE02 A 
LE02 A 


LFO1 LFO02 A 
LF02 A 


LGO1 LG02 A 
LG02 A 


& | We 


19 p  LPOl LPO2 A 

19 =P LP02 A 

0 LQ01 LQ02 A 
LQ02 A 
r LROl LRO2 A 
R  LR02 A 


S LSO1] LS0O2 A 
22 S LSO02 A 


LS21 LS22 A 
LS22 A 


LTO! LT02 A 
24 LT02 A 


25 u LUO! LU02 A 
U LU02 A 


Vv LVOl LV02 A 
26 V LV02 A 


_ 
Oh 


AN 


nN OW 
7 HY | 


~s~ 


8 
8 


es) 


WO Oo 
jy > 


LHO!l LH02 A 27 w LWO01 LW02 A 
LH02 27 Ww LW02 A 
LIO!1 LI02 28 X LX0OL LX02 A 


LI02 


LJO1 LJ02 
LJO2 


LKO1 LKO02 A 
LK02 A 


LLO1 LLO2 A 
LL02 A 


m LMO0l1 LM02 A 
M LMO02 A 


X  LX02 


LYO1 


A 


LY02 A 
LY02 A 


Z LZ01 LZ02 A 
30 Z LZO02 A 


31% LZ21 LZ22 A 
31. Z LZ22 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 


Figure 3-33. lLatin-Character Table for Yugoslavia 
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Seq Char ID Mono Alpha Note 


Num case /Num 
l a KAO] KA02 A 
l A KAO2 A 
2 6 KBO1 KBO2 A 
2 b KBO02 A 


3 B KV01 KV02 A 
3 B  Kv02 A 
KG0O1 KG02 A 
KG02 A 


Mnih & 
oy a 


oa) 


mn KDO1 KDO2 A 
MQ KD02 A 
r KGll KG1l2 A 
KG12 


KD61 KD62 A 


nr 
mh 
> 


KE0O1 KEO2 A 


oO .O |] 0 CO] ~] ~] 

ME [me | oo 
‘Ss 
o, 
ON 
No 

> Prl >>| > 


KE02 
KZ21 KZ22 A 
KZ22 
i0 3  KZOl KZ02 A 
i0 3  KzZzO02 A 
11 s  KZIS KZ16 A 
11 S  KZI6 A 
12 wu KI0Ol KI02 A 
12 WU KiI02 A 
13. j  KJOl K4JO2 


> > 


13 J KJ02 


14 «xk KK0O1l1 KK02 A 
14 K KKQ2 


15 i KLO1 KLO2 A 


> 


15 JI KLOQ2 A 
16 mw KI41l KL42 A 
16 Jb KIA2 A 
17 m KMOl KM02 A 


M KMO02 A 


Figure 3-34. Cyrillic-Character Table for Yugoslavia 


Seq Char ID Mono Alpha Note 


Num 


18 
18 


19 
19 


20 
20 


21 
21 


22 
22 


5 
25 


26 
26 


27 
27 


28 
28 


29 
29 


30 
30 
31 
31 


32 
32 


33 
33 


case /Num 


H KNO1 KNO2 
H KNOQ2 


® KNI1 KNI2 
Ih KNI12 


O KO01 KO02 
QO KO02 


mM KP01 KP02 
II KP02 


p KROL KRO2 A 
P  KRO02 


3 
24 KTO1 KT0O2 
24 KT02 
2 


A 

23 Cc KSO1 KS02 A 
2 C KS02 A 
T A 

T A 

A 


kK  KK1i1 KK12 
kK  KK12 


h KCl1 KC12 
hh KC12 


y KU01 KU02 
Y  KU02 


mb KFO] KF02 
Db KF02 


X KHO0Ol KHO2 A 
xX  KHO02 


Lf KC0Ol KC02 
IY KC02 


4 KC21 KC22 
4 KC22 


u KG21 KG22 
Il KG22 


m KS21 KS22 
IW KS822 A 


All other characters are sorted as 
shown in Figure 3-1 on page 3-2. 
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Keyboards 


This section provides information on types of locks used (caps lock or shift lock) 
and on types of numeric keypads. 


Shift Lock 
Shift lock is the equivalent to keeping the shift key down to access characters 
engraved on the upper-shift position of the keys. Other shifts such as the 
Alternate or Control are not affected by Shift lock. 

Caps Lock 


Caps lock converts lowercase alphabetic characters to their monocase (uppercase) 
character. It 1s equivalent to the monocase rules given for the various character 
sets, and it is performed regardless of the shift position of the lowercase alpha 
character. For example, if caps lock is activated and é being in the upper-shift 
location of the key is pressed along with the shift key, then the character E or E 
is generated, depending on the monocase rules of the particular character set. 


Numeric Pad Formats 


These diagrams show the numeric keypads used in Figure 3-35 and Figure 3-36. 2 


Format 1 Format 2 Format 3 
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Country Numeric Type of Alternative 
Pad lock Type of 
Definition Preferred Lock 


Czechoslovakia 


Germany (GDR) 


Format 1 
Fonpat 1 
Format 


Format 2 
Format | 


Fonnat 
Thailand Format 2 


Format 2 


Figure 3-35 (Part 1 of 2). Keyboard Information Table |: Left-to-Right Languages 
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Country Numeric Type of Alternative 
Pad lock Type of 
Definition Preferred Lock 

shift pe 


Figure 3-35 (Part 2 of 2). Keyboard Information Table 1: Left-to-Right Languages 


Notes on Figure 3-35: 
1. The country wants to be able to switch between caps lock and shift lock. 


2. Both caps lock and shift lock are acceptable. 


Country Numeric Type of Alternative 
Pad lock Type of 
Definition Preferred Lock 
Japan| Format2 | | 
[Korea | Format2_ | 
PROT ica, | shi 
ROC (Taiwan) 


Figure 3-36. Keyboard Information Table 2: DBCS Languages 


@ 
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Currency 


The following section is about money and how it is represented in various 
countries. In particular, this section describes: 


e The symbol used to represent the currency 

e The currency format used 

e The representation of negative values 

e The padding character used 

¢ The characters used for thousands separator and decimal separator. 

The following tables (Figure 3-37 and Figure 3-38) show the currency symbols 
used in each country, the format used in each country to state the value of an 
item of money, the format used to indicate a negative amount of money, and the 


International Currency Sign. 


Standards Note: The International Currency Sign is taken from the [SO 42/7 
Codes for the representation of currency and funds. 
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Country Currency Positive Negative International 
Name Format Format Format 

Lek123,45 | -Lek123,45__ | ALL 123,45 

Al23,45 | -A123,45___ | ARP. 123,45 


12345 [-$ 12349 | ATS 123.4 
Belgium Belgian Franc 123,45 BF -123,45 BF BEF 123,45 
(Note 1) (Note 2) 
Bulgaria ‘Lew ss | Lw123,45 | -Lw123,45 | BGL 123,45 
Canada (Note 3) | Canadian $123.45 -$123.45 CAD 123.45 
Dollar 123,45 $ (123,45 $) 
Czechoslovakia Kés 123,45 | -Kés 123,45 | CSK 123,45 
Denmark kr 123,45 | kr -123,45 | DKK 123,4 


be 
< 


Finland Markka FIM 123,45 
Mk -123,45 

France FRF_ 123,45 

Germany (GDR DDM _123,4 

Germany (FRG) | Deutsche Mark DEM 123,4 


Greece Drachma Drs123,45 -Drs123,45 GRD 123,4 


Hungary FT123,45 | -FT123,45 | HUF 123,45 


Iceland Icelandic 123,45 kr - 123,45 kr ISK 123,45 
Krona 


L. 123 “L. 123 ITL. 123 
Netherlands F 123,45 | F -123,45 NLG 123,45 


New Zealand New Zealand $123.45 -$123.45 NZD 123.45 
Dollar 


~—” 


[South Aftica [Rand R12345___[(RI2345)__[ ZAR 123.45 
(Note 4) -123 Pts 
123TL -123TL 

£12345 


Figure 3-37 (Part 1 of 2), Currency Format Table 1: Left-to-Right Languages 
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Country Currency Positive Negative International 
Name Format Format Format 

USA American $123.45 -$123.45 AMD 123.45 
Dollar > 


Din 123,45 | -Din 123,45 | YUD 123,45 


Figure 3-37 (Part 2 of 2). Currency Format Table 1: Left-to-Right Languages 


Notes on Figure 3-37: 
1. “123,45 FB” and “123,45 Fr” are also acceptable. 


2. BEC (convertible franc) and BEL (financial franc) are also used as currency 
symbols. 


3. Canadian English documents use the first format, and Canadian French 
documents use the second format. 


4. The singular is Pta, but the plural can be Ptas or Pts. Both “Pts 123” and 
“123 Pts” are acceptable. 


¥123 -¥123 JPY 123 


Korea Won W 123.45 -W 123.45 KRW 123.45 
(Note 1) 
PRC Yuan ¥123 -¥ 123 CNY 123 
Ren Min Bi 


ROC (Taiwan) NT Dollar $123.45 -$123.45 TWD 123.45 


Figure 3-38. Currency Format Table 2: DBCS Languages 


Country Currency Positive Negative International 
Name Format Format Format 


Notes on Figure 3-38: 


1. (W123.45) is also used. 
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The tables on Figure 3-39 and Figure 3-40 show the characters used as .) 
thousands and decimal separators, the number of decimal positions, the length of 

the currency field used in the country (including the decimal positions, if 

applicable), and the symbols used in each country to pad out the format (eg. 

$***0.45), 


Standards Note: The International Conference on Weights and Measurements 
has recommended that neither the Comma nor the Fullstop be used as the triad 
(thousand) separator but that the Space be used instead. 


Country Thousand | Decimal Currency | Currency | Padding 
Separator | Separator | Decimal | Field Character 
Peierls aaa 


Albania 


| A IR POR a SS 
a a a ee ee ee 
Es 


Bulgaria | 


oe a ee 
' (Note 3) "(Note 4) 
[Czechoslovakia |. |, | 2 tt 
a 
Finland {space f, | 2 
10 


France space 


pGemany(GDRY|6 = 


i 


ESR SE A a 
a: 


Greece 


ane 
— 
PEPE 


Hungary space 


| 
ielnd it. id is 
iy | none | is 
[Nethedands —[. —<idis 
[New Zealand [, 
[Nowy 
[Poland it. 
a 
Romania _[. 
Ca 
a 


Figure 3-39 (Part 1 of 2). Currency Separator Table 1: Left-to-Right Languages 
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Thousand | Decimal Currency | Currency | Padding 
Separator | Separator | Decimal i 
Position 


kia A PS A 


Figure 3-39 (Part 2 of 2). Currency Separator Table |: Left-to-Right Languages 


Notes on Figure 3-39: 


1. Belgium official documents use a space as a thousands separator, rather than 
the period, and American companies use a comma as a thousands separator. 


2. Belgium official documents also use a comma as a decimal separator, but 
American companies use a period as a decimal separator. 


3. Canadian English documents use a comma as a thousand separator, and 
Canadian French documents use a space as a thousand separator. 


4. Canadian English documents use a period as a decimal separator, and 
Canadian French documents use a comma as a decimal separator. 


Decimal 
Separator 


Thousand 
Separator 


Padding 


Currency | Currency 
i Character 


Decimal 
Position 


Japan |, fone | none | 14 | undefined _ 
Korea | 2 tome 
ROU 2 none 
PROC (Taiwan) |, TE 


Figure 3-40. Currency Separator Table 2; DBCS Languages 
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Number Rounding 


In general, numbers round according to the following (where . represents the 
decimal separator): 


XYZ.0 through XYZ.4 roundto XYZ 
XYZ.5 through XYZ.9 round to XY(Z+1) 


For example: 123.4 roundsto 123 
123.7 roundsto 124 


Numbers with two decimal positions round according to the following (where . 
represents the decimal separator): 


XYZ.ABO through XYZ.AB4 round to XYZ.AB 
XYZ.AB5 through XYZ.AB9 round to XYZ.A(B+t 1) 


For example: 123.454 rounds to 123.45 
123.457 rounds to 123.46 


Argentina rounds two decimal places as above. The third decimal place (second 
in some cases) rounds as follows (where . is the decimal separator although , is 
also accepted): 


XYZ.AB1 and XYZ.AB2 round to XYZ.ABO } 
XYZ.AB3 through XYZ.AB7 roundto XYZ.AB5 
XYZ.AB8 and XYZ.AB9 round to XYZ.A(B+ 1)0 


For example: 123,452 rounds to 123,450 
123,453 rounds to 123,455 
123,458 rounds to 123,460 


Switzerland has different rounding rules for monetary values as shown below 
(where . is the decimal separator): 


XY.(Z-1)76 through XY.Z25 round to XY.Z0 
XY.Z26 through XY.Z75 round to XY.Z5 


For example: 17.376 rounds to 17.40 
23.525 rounds to 23.50 
9.826 rounds to 9.85 
3.475 rounds to 3.45 


For nonmonetary values, the Swiss rounding follows normal conventions. 


3-98 National Language Information and Design Guide Volume 2 


C 


Number Notation 


The following tables (Figure 3-41 and Figure 3-42) show the 
notation used to represent numbers in the various countries. 


Format Format 
12 345,67 | (12 345,67) 
USA 12,345.67 


Figure 3-41] (Part | of 2). Number Notation Table |: Left-to-Right Languages 
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Country Positive Negative 
Format Format 
12.345,67 | -12.345,67 


Figure 3-41 (Part 2 of 2). Number Notation Table 1: Left-to-Right Languages 


Notes on Figure 3-41: 


1. Canadian English documents use the first format, and Canadian 
French documents use the second format. 


As an alternative negative format, Canadian English documents 
use (12,345.67), and Canadian French documents use -12 345,67. 


2. Sweden uses a period as a thousand separator when the amount is 
a monetary amount. 


Figure 3-42. Number Notation Table 2: DBCS Languages 


Country Positive Negative 
Format Format 
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Percent Symbol 


Calendars 


The usual format used to indicate percent is: 
37 % 


South Africa and Switzerland prefer no space between the number and the 
symbol, as follows: 


37% 


The Dutch Language used by Belgium and The Netherlands sometimes uses the 
abbreviation pct., as follows: 


37 pet. 
In Turkey, the percent symbol is written before the number: 


%37 


Most countries use the Gregorian calendar. The following countries use other 
calendars: 


e Japan uses the Gregorian calendar but also uses a Japanese calendar, which 
differs from the Gregorian calendar in that an era name replaces the year. 
The era name indicates the year of the reign of the emperor. For example, 
the era name SHOWA 60 is equivalent to 1985. In the example, SHOWA 
60 means year 60 in the reign of the current emperor, who is named 
SHOWA. 


® ROC also uses the Chinese lunar calendar. 


e Thailand uses the Buddhist Era (B.E.) calendar. 


Chapter 3. Technical Characteristics 3-101 


Date Format 


The following tables (Figure 3-43 and Figure 3-44) show the 
preferred and alternative formats for recording the date. All 
examples use the nineteenth day of the tenth month of 1986. 


Country Date Alternative 
Format Date 
Preferred Format 


1986-10-19 
1986-X-19 


86-10-19 (Note 3) 19-10-86 (Note 3) 


Sweden 


Thailand 19/10/86 (Note 6) dd mmmmm yyyy 
(Note 7) 


19/10/86 86/10/19 


Figure 3-43 (Part | of 2). Date Format Table 1: Left-to-Right Languages 
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Country Date Alternative 
Format Date 
Preferred Format 


19/10/86 
10/19/86 86/10/19 
1986-10-19 86-10-19 


Figure 3-43 (Part 2 of 2). Date Format Table 1: Left-to-Right Languages 


Notes for Figure 3-43: 
1. Belgium Official documents use 19.10.86. 
2. Bulgaria uses Roman numerals for the month number. 


3. For Canadian French documents. 


4. 19/10/1986 and 19/10/86 are also acceptable. The official Portuguese 


standard (that is, 1986-10-19) is like the ISO standard, but it is not 
commonly used. 


5. Leading zeros must not be suppressed (for example 1986-05-08). 


6. Year is replaced by Buddhist Era. 


7. Official documents use this date format. Spaces are used as separators. The 
month is abbreviated to five Thai characters, and the year is replaced by 


Buddhist Era and uses four Thai characters. 
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Country Date Alternative 
Format Date 
Preferred Format 


1986-10-19 
PRC (1986.10.19 | mone 


Figure 3-44. Date Format Table 2: DBCS Languages 


Notes for Figure 3-44: 
1. 86.10.19 is an alternative date format. 


2. 19/10/86 is an alternative date format. 
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Time Format 


Tables Figure 3-45 and Figure 3-46 show the format for recording the time. All 


examples use a time of 10 hours, 42 minutes, 00 seconds; some have an 


additional 30 milliseconds shown. 


Standards Note: According to the International Standards Organization (ISO), 
standards IS 3307 and IS 1000, the decimal digits representing “fractions of a 
second” and the decimal digits representing the ‘‘seconds” should be separated by 
the same character representing the “decimal separator” (a comma or a period) 
for the country. (See section on Number Notation.) The fractions could be 
tenths of a second (one digit), hundredths of a second (two digit) or thousandths 


of a second (three digits). 


Format of 
Preferred Hours 
[France 


France 


Figure 3-45 (Part 1 of 2). Time Format Table 1: Left-to-Right Languages 
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Country Time Range 
Format of 
Preferred Hours 


10h42:00.030 
10:42:00 
KI 10.42.00 


Switzerland 10,42,00 0 - 23 
(Note 1) 


0-23 


Figure 3-45 (Part 2 of 2). Time Format Table |: Left-to-Right Languages 


Notes on Figure 3-45: 


1. While the decimal separator for currency is the period, the comma is used for 
all other fractions; thus for seconds, the comma is used. 


Country Time 
Format 
Preferred 


Figure 3-46. Time Format Table 2: DBCS Languages 
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C Weights and Measurements System 


The following tables (Figure 3-47 and Figure 3-48) 
show the system of measurements used in the countries. 


Country Measurement 
System 


Finland | Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric 

Metric (Note 1 
Metric 

Metric (Note 1) 
USA (Note 2) 


Figure 3-47. Weights and Measurements Table 1: Left-to-Right Languages 


Notes for Figure 3-47: 


1. Imperial is still used in some areas. 


Sp) 
Ss 
fl 
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2. Most units are Imperial, or Imperial-derived USA units. 
In the scientific area, Metric units are used. 


System 


Figure 3-48. Weights and Measurements Table 2: DBCS Languages 
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C Additional Symbols that Vary from Country to Country 


France sometimes uses superscripted letters when abbreviating words. For 
example, the word Compagnie is abbreviated to Cie, but with the letters 1 and e 
written in a smaller font and superscripted. This technique is also practiced in 
English when abbreviating ordinal numbers. For example, the ordinal number, 
second is abbreviated to 2nd, but with the letters n and d written in a smaller font 
and superscripted. In the latter case, the abbreviated form is often seen without 
the tail letters superscripted, when superscripting is not available. 


Argentina, Belgium, Bulgaria, France, Italy, Portugal, and UK use N followed by 
a superscripted lowercase o (SMO0), as a symbol for number, not #. 


The Dutch Language used by Belgium and The Netherlands uses No. or nr. as 
short forms of the word number. 


Norway uses the following abbreviations: 

nr. for number 

kl for time (see note following Figure 3-45 on page 3-105) 
md for month 


USD for USA dollar 


‘ In the Netherlands, + is used to indicate approximately. 
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Appendix. National Language Standards and Laws 


This appendix lists: 

e Some standards organizations and their addresses 

e Some language standards that relate to graphic symbols 

e Some language laws that have been passed by various countries. 

The following lists of standards organizations, standards, and national laws are not an exhaustive 


compilation. The standards organizations and national governments should be consulted for current and 
complete versions of the material. 


Language-Related Standards 


Standards Organizations’ Addresses 


AFNOR Association francaise de normalisation 
Tour Europe, Cedex 7, 92080 PARIS LA DEFENCE 


ANSI American National Standards Institute 
1430 Broadway, New York, N.Y. 10018 


ASMO Arab Organization for Standardization and Metrology 
P.O. Box 926161, Amman, Jordan 


CAS China Association for Standards 
P.O. Box 820, Being, China 


CCITT Consultative Committee for International Telegraph and Telephone 
2, rue de Varembe, CH-1211 Geneva 20, Switzerland 


CSA Canadian Standards Association 
178 Rexdale Bvld, Rexdale, Ontario, M9W 1R3 


DIN Deutsches Institut fuer Normung 
Burggrafenstrasse 4-10, Postfach 1107, D-1000 Berlin 30 


EMCA European Computer Manufacturing Association 
114 Rue du Rhone, 1204 Geneva, Switzerland 


Appendix. National Language Standards and Laws A-1 


FIPS Federal Information Processing Standards J 
U.S. National Bureau of Standards, Institute for Computer Sciences and Technology, 
Gaithersburg, MD 20899 


IEC International Electotechnical Commission 
3, rue de Verambe, CH-1211 Geneva 20, Switzerland 


ISO International Standards Organization 
1, rue de Verambe, Case Postal 56, CH-1211 Geneva 20, Switzerland 


JIS Japanese Industrial Standards Committee 
c/o Standards Department, Agency of Industrial Science and Technology Ministry of 
International Trade and Industry, 1-3-1, Kasumigaseki Chiyoda-ku, Tokyo 100 


SCC Standards Council of Canada 
International Standardizations Branch, 2000 Argentina Road, Suite 2-401, 
Missassauga, Ontario, LSN 1V8 


Sample National Language Standards 


AFNOR NF 62-10 


ANSI BSR X3.134.1 


ANSI BSR X3.134.2 


ANSI X3.110-1983--CSA T3500 


ANSI X3.32 


ANSI X3.4 


ANSI X3.41 


ANSI X3.64 


ANSI X4.16 


CAS GB1988-80 


CAS GB2311-80 


CAS GB2312-80 


CCITT T.61 


French 7-Bit Code 


8-Bit ASCII Structure and Rules 


U 


8-Bit ASCII Supplemental Multilingual Graphic Character Set 


Videotex/Teletex Presentation Level Protocol Syntax - North American 
PLPS 


Graphic Representations of the Control Characters of ASCII 

American National Standard Code for Information Interchange 

Code Extension Techniques for Use with ASCII 

Additional Controls for Use with ASCII 

American National Standard Magnetic Stripe Encoding for Credit Cards 
7-Bit Coded Character Sets for Information Processing and Interchange 


Information Processing - 7-Bit Coded Character Set - Code Extension 
Techniques 


Code of Chinese Graphic Character Set for Information Interchange - 
Primary Set 


Character Repertoire and Coded Character Sets for the International Teletex 


Service } 
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CCITT T.100.1984 


CSA Z2243.4-1985 


DIN 66003 


FIPS PUB 15 


IEC - 417 


ISO DIS 4882 


ISO DIS-6329 


ISO-IR 


ISO R1090 


ISO R1093 


ISO 646 


ISO 2022 


ISO 2375 


ISO 3461 


ISO 4062 


ISO 4217 


ISO 4873 


ISO 6429 


ISO 6937 
ISO 7350 
ISO 8859 


JIS C 6226 


International Information Exchange for Interactive Videotex 


7-Bit and 8-Bit Coded Character Sets for Information Processing and 
Interchange 


Informationsverarbeitung 7-Bit Code (German 7 Bit Code) 


Federal Information Processing Standards - Subsets of the Standard Code 
for Information Interchange 


Graphic Symbols for Use on Equipment 


Office Machines and Data Processing Equipment, Line Spacings and 
Character Spacings 


Symbols for Duplicating and Document Copying Machines 


International Register of Coded Character Sets to be Used with Escape 
Sequences - Registration Authonty: ECMA, Geneva 


Functions Key Symbols for Typewriters 


Keytop and Printed or Displayed Symbols for Adding Machines and 
Calculating Machines 


Information Processing - 7-Bit Coded Character Set for Information 
Interchange 


Information Processing - ISO 7-Bit and 8-Bit Coded Character Sets - Code 
Extension Techniques 


Data Processing - Procedure for Registration of Escape Sequences 
Graphic Symbols, General Principles for Presentation 

Dictation Equipment Symbols 

Codes for the representation of currency and funds 


Information Processing - 8 Bit Code for Information Interchange - Structure 
and Rules for Implementation 


Information Processing - ISO 7-Bit and 8-Bit Coded Character Sets - 
Additional Control Functions for Character Imaging Devices 


Information Processing - Coded Character Sets for Text Communication 
Text Communication - Registration of Graphic Character Sub-repertoires 
8-Bit Single-Byte Coded Graphic Character Sets 


Code of the Japanese Graphic Character Set for Information Interchange 


Appendix. National Language Standards and Laws A-3 


National Laws 


Most countries have language laws affecting the importation, sale or use of data 
processing equipment, software and documentation. Some of the laws specify the 
language(s) to be used on labels, keyboards, documentation, and software. Other 
laws regulate cultural aspects such as date formats, calendars, and numeric 
representation. Since there are many laws and constant revision of laws, only the 
country issuing a law can adequately describe it. The following are a few of the 
language laws which have been issued (but may not be current): 


Canada: Quebec Law 101, Charter of the French Language, August 26, 
1977; Canadian Official Language Act of 1969. 


Sweden: The Work Environment Act, Ist July 1978. 


Venezuela: Consumer Protection Law, Article 10. 
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Glossary 


This glossary includes definitions of some terms found in 
this document. Some of the terms defined below are 
from The American National Dictionary for Information 
Processing, copyright 1977 by the Computer and 
Business Equipment Manufacturers Association, copies 
of which may be purchased from the American National 
Standards Institute a 1430 Broadway, New York, New 
York 10018. These definitions are identified by an 
asterisk. The symbol (ISO) at the beginning of a 
definition indicates that it is reproduced from a 
published section of the International Organization for 
Standardization Vocabulary of Data Processing or from 
a published section of the ISO Vocabulary of Office 
Machines. 


Arabic numerals. The numerals 1, 2, 3, 4, 5, 6, 7, 8, 9, 
and 0. See also Roman numerals. 


AZERTY keyboard. A keyboard in which the keys in 
the second-from-top row (row D) are labeled (from left 
to right): A, Z, E, R, T, Y, U, I, O, and P. See also 
QWERTY keyboard. 


Character Identifier. An identifier assigned to each 
graphic character. 


character set. A defined set of characters. No coded 
representation is assumed. 


code page. A specification of code points for each 
graphic character in a set or in a collection of graphic 
character sets. Within a code page, a code point can 
have one and only one specific meaning. 


*coded character set. (ISO) A set of unambiguous rules 
that establish a character set and the one-to-one 
relationships between the characters of the set and their 
coded representations. 


CSU. See customer setup. 


Customer setup. The unpacking, setup, and checkout of 
IBM CSU-designated machines by user personnel, 
according to a sequence of instructions provided by 
IBM, without the use of tools or the assistance of IBM 
personnel. 


data entry. The method of entering data into a 
computer system for processing, usually in a 
field-oriented environment where the entry is governed 
by a program. See text entry for comparison. 


DBCS. See double-byte character set. 


dead key. On a typewriter and in word processing, a 
key that allows a character to be typed without the 
imprint position being changed. This term should be 
replaced with nonescaping key. 


Diacritics. Modifying marks of a character, for 
example, the accent mark. 


double-byte character set. A character set in which i 
character is represented by a 2-byte code. Some 
character sets, such as Kanji, which is used in Japan, 
are too rich in symbols to be able to represent all the 
characters using 1-byte codes. A double-byte character 
set is used to represent the symbols that make up these 
larger character sets. 


EBCDIC. Extended Binary Coded Decimal 
Interchange Code. A coded character set consisting of 
8-bit coded characters. 


enable (national languages). To design a product to 
facilitate the inclusion of national language functions. 


folding. The substitution of one character for another, 
generally to map a larger character set into a subset. 
Folding may result in the loss of information. For 
example, folding allows the printing of uppercase 
graphic characters when lowercase characters are not 
available in the character array on the printer chain or 
train. 


*graphic character. (ISO) A character, other than a 
control character, that is normally represented by a 


graphic. 


graphic character set. A defined set of graphic 
characters. No coded representation is assumed. 
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ideographic language. A written language in which each 
character represents a thing or an idea. An example of 
such a language is Chinese. See also phonetic language. 


implement (national languages). To develop, integrate, 
test, and release specific national language functions on 
a product. 


ISO. International Organization for Standardization. 


*language. (ISO) A set of characters, conventions, and 
rules, that is used for conveying information. The three 
aspects of language are pragmatics, semantics, and 
syntax. 


Latin alphabet. An alphabet comprising the letters a, b, 
c, d, e, f, g, h, 1, j, k, 1, m,n, 0, p, q, r, Ss, t, U, V, W, X,Y, 
and z in uppercase lowercase, with or without accents. 


Latin alphabet number 1. Latin alphabet number 1| is 
set of 190 characters as described by ISO 8859/1. 


lowercase. The small letters a, b, c, d, e, f, g, h, i, j, k, 1, 
m,n, 0, p, q,r, Ss, t, u, V, W, X, y, and z, and other 
characters in the lower shift. See also uppercase. 


machine readable information (MRI). All textual 
information contained in a program such as a system 
control program, an application program, or microcode. 
MRI includes all information that is presented to or 
received from a user interacting with a system. This 
includes menus, prompts, messages, report headings, 
commands, and responses. MRI may appear on 
printers or on display panels. Contrast with machine 
readable material (MRM), which includes MRI, 
executable code, and the constants. 


MLP. See multilingual code page. 

monocasing. The translation of alphabetic characters 
from one case (usually the lowercase) to their 
equivalents in another case (usually the uppercase). 


MRI. See machine readable information. 


multilingual code page (MLP). A code page supporting 
more than one language. 


national language support. The ability for a user to 
communicate with products in a language other than US 
English. 


national use graphics. Graphics on a code page not 
forming part of the invariant character set. 


NL function. National language function. Those 
functions permitting a user to interact with products 
using a national language. They include character sets, 
code pages, all language, logistics, and cultural 
functions, and translatability considerations. 


NLS. See national language support. 


nonescaping key. On a typewriter and in word 
processing, a key that allows a character to be typed 
without the imprint position being changed. 


phonetic language. A written language in which each 
character represents a sound. Examples of phonetic 
languages are English, Greek, and Russian. See also 
ideographic language. 


product. A hardware or software unit that is 
individually packaged, that is marketable, and to which 
a type number/program number has been assigned. 


QWERTY keyboard. Keyboard in which the keys in 
the second-from-top row (row D) are labeled (from left 
to right): Q, W, E, R, T, Y, U, I, O, and P. See also 
AZERTY keyboard. 


retrofit (national languages). To implement national 
language function on an unenabled product. 


Roman numerals. The numerals I, V, X, L, C, D, and 
M. See also Arabic numerals. 


SBCS. See single-byte character set. 


single-byte character set. A character set in which each 
character is represented by a |-byte code. 


special character. A graphic character that is not a 
letter, not a digit, and not a space character. 


system. A collection of interconnected hardware and 
software organized to accomplish a set of specific 
functions. 


text entry. Entry of characters in a free format. See 
data entry for comparison. 


uppercase. The capital letters A, B, C, D, E, F, G, H, 
I,J, K, L, M, N, O, P, Q, R, 8, T, U, V, W, X, Y, and 
Z, and other characters in the upper shift. See also 
lowercase. 


user. A person who requires the services of a 
computing system. 
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description 2-13 
keyboard information 3-91 
language 
description 2-13 
name 2-13 
percentage of population 2-14 
number format 3-99 
sort sequence 
considerations 3-50 
time format 3-105 
unique characteristics of the language 2-13 
weights and measurements 3-107 
New Zealand 
alphabet characteristics 2-14 
character classification 
considerations 3-52 
character table 3-52 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-14 
keyboard information 3-9] 
language 
description 2-14 
name 2-14 
percentage of population 2-14 
number format 3-99 
sort sequence 
considerations 3-52 
time format 3-105 
unique characteristics of the language 2-14 
weights and measurements 3-107 
NL function, definition X-2 
NLS, definition X-2 
nonescaping key 
definition X-2 
usage 2-4, 2-20 
Norway 
alphabet characteristics 2-15 
character classification 
considerations 3-54 
character table 3-54 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-15 
keyboard information 3-91 
language 
description 2-15 
name 2-15 
percentage of population 2-15 
number format 3-99 
sort sequence 
considerations 3-54 
time format 3-105 
weights and measurements 3-107 
number 
notation table 3-99, 3-100 
rounding 3-98 


numeric pad 
format 3-90 
table 3-91, 3-92 


P| 


padding character 
table 3-96, 3-97 
percent symbol 3-10] 
phonetic language, definition X-2 
phonetic sequence 
sort 
PRC 3-56 
ROC 3-62 
Poland 
alphabet characteristics 2-16 
character classification 
considerations 3-57 
character table 3-57 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-16 
keyboard information 3-9] 
language 
description 2-16 
name 2-16 
percentage of population 2-16 
number format 3-99 
sort sequence 
considerations 3-57 
time format 3-105 
weights and measurements 3-107 
Portugal 
alphabet characteristics 2-17 
character classification 
considerations 3-59 
character table 3-59 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-16 
keyboard information 3-91 
language 
description 2-17 
name 2-16 
percentage of population 2-17 
number format 3-99 
sort equal weight characters 3-59 
sort sequence 
considerations 3-59 
time format 3-105 
unique characteristics of the language 2-17 
weights and measurements 3-107 
PRC 2-15 
See also PRC 


Index X-1] 


alphabet characteristics 2-16 
character set 
considerations 3-56 
currency 3-95 
currency separator symbols 3-97 
date format 3-104 
DBCS 3-56 
description 2-15 
keyboard information 3-92 
language 
description 2-15 
name 2-15 
percentage of population 2-16 
monocasing considerations 3-56 
number format 3-100 
SBCS 3-56 
sort 
phonetic sequence 3-56 
radical within stroke 3-56 
stroke within radical 3-56 
sort sequence 
considerations 3-56 
time format 3-106 
weights and measurements 3-108 
product, definition X-2 


[9] 


QWERTY keyboard 
definition X-2 
usage 2-5 


| 


radical within stroke 
sort 
PRC 3-56 
ROC 3-62 
retrofit, definition X-2 
ROC 
alphabet characteristics 2-18 
calendar 3-101 
character set 
considerations 3-62 
currency 3-95 
currency separator symbols 3-97 
date format 3-104 
DBCS 3-62 
description 2-17 
keyboard information 3-92 
language 
name 2-17 
percentage of population 2-18 


monocasing considerations 3-62 
number format 3-100 
SBCS 3-62 
sort 
phonetic sequence 3-62 
radical within stroke 3-62 
stroke within radical 3-62 
sort sequence 
considerations 3-62 
time format 3-106 
weights and measurements 3-108 
Roman numerals 
definition X-2 
Korea usage of 3-63 
Romania 
alphabet characteristics 2-19 
character classification 
considerations 3-64 
character table 3-64 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-19 
keyboard information 3-91 
language 
description 2-19 
name 2-19 
percentage of population 2-19 
number format 3-99 
sort sequence 
considerations 3-64 
time format 3-105 
weights and measurements 3-107 


| 


SBCS 

See single-byte character set 
shift lock 

description 3-90 

table 3-91, 3-92 
single-byte character set 

definition X-2 


Japan 3-48 

Korea 3-63 

PRC 3-56 

ROC 3-62 
sort 

general 3-2 

Japan 


dictionary 3-48 

strokes and radicals 3-48 

telephone directory 3-48 
PRC 

phonetic sequence 3-56 
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radical within stroke 
stroke within radical 
ROC 


3-56 
3-56 


phonetic sequence 3-62 


radical within stroke 
stroke within radical 
sort equal weight characters 
Argentina 3-7 
Austria 3-10 
Belgium 3-13 
Canada 3-17 
Czechoslovakia 3-20 
France 3-29 
FRG 3-34 
GDR_ 3-32 
Hungary 3-4] 
Italy 3-46 
Portugal 3-59 
South Africa 3-66 
Spain 3-70 
Turkey 3-82 
sort exits 3-3 
sort sequence 3-1 
considerations 3-2 
Albania 3-5 
Argentina 3-7 
Austria 3-10 
Belgium 3-13 
Canada 3-17 
Denmark 3-22 
Finland 3-25 
France 3-28 
FRG 3-34 
GDR_ 3-32 
Greece 3-37 
Hungary 3-4] 
Iceland 3-43 


Italy 3-46 
Japan 3-48 
Korea 3-63 


Netherlands 3-50 
New Zealand 3-52 
Norway 3-54 
Poland 3-57 
Portugal 3-59 
PRC 3-56 
ROC 3-62 
Romania 3-64 
South Africa 3-66 
Spain 3-69 
Sweden 3-73 
Switzerland 3-76 
Thailand 3-79 
Turkey 3-82 
UK 3-84 
USA 3-86 
Latin alphabet number | 
MLP_ 3-3 
FRG 3-34 


3-62 
3-62 


3-2 


Switzerland 3-76 
South Africa 
alphabet characteristics 2-20 
character classification 
considerations 3-66 
character table 3-66 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-19 
keyboard information 3-91 
language 
description 2-20 
name 2-20 
percentage of population 2-20 
number format 3-99 
sort equal weight characters 3-66 
sort sequence 
considerations 3-66 
support of more than one language 2-20 
time format 3-106 
unique characteristics of the language 2-20 
weights and measurements 3-107 
Spain 
alphabet characteristics 2-21 
character classification 
considerations 3-69 
character table 3-70 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-20 
keyboard information 3-9] 
language 
description 2-21 
name 2-21 
percentage of population 2-2] 
number format 3-99 
sort equal weight characters 3-70 
sort sequence 
considerations 3-69 
support of more than one language 2-2] 
time format 3-106 
unique characteristics of the language 2-2] 
weights and measurements 3-107 
Spanish 


language 
Argentina 2-2 
Spain 2-2] 


special character 

definition X-2 

sort 3-2 
standards 

language-related A-2 
standards organizations' addresses A-1 
stroke within radical 

sort 

PRC 3-56 


Index 


X-13 


ROC 3-62 


strokes and radicals 


sort 
Japan 3-48 


Sweden 


alphabet characteristics 2-22 
character classification 
considerations 3-73 
character table 3-73 
currency 3-94 
currency separator symbols 3-96 
date format 3-102 
description 2-22 
keyboard information 3-91 
language 
description 2-22 
name 2-22 
percentage of population 2-22 
number format 3-99 
sort sequence 
considerations 3-73 
time format 3-106 
weights and measurements 3-107 


Switzerland 3-76 


alphabet characteristics 2-23 
character classification 
considerations 3-75 
character table 3-76 
currency 3-94 
currency separator symbols 3-97 
date format 3-102 
description 2-22 
keyboard information 3-91 
language 
description 2-23 
name 2-22 
percentage of population 2-23 
monocasing 
considerations 3-75 
number format 3-99 
sort sequence 
considerations 3-76 
time format 3-106 
unique characteristics of the language 2-23 
weights and measurements 3-107 


system, definition X-2 


character 

description of heading 3-4 
currency field length 3-96, 3-97 
currency format 3-94, 3-95 
currency separator symbols 3-96, 3-97 


table 


currency symbol 3-94, 3-95 
date format 3-102, 3-104 
date separator symbols 3-102, 3-104 
keyboard information 3-91, 3-92 
number notation 3-99, 3-100 
padding character 3-96, 3-97 
time format 3-105, 3-106 
time separator symbols 3-105, 3-106 
weights and measurements 3-107, 3-108 
technical characteristics 3-1 
telephone directory 
sort 
Japan 3-48 
terminology used in Volume 2 1-1 
text entry, definition X-2 
Thailand 
alphabet characteristics 2-24 
calendar 3-101 
character classification 
considerations 3-79 
character table 3-79 
currency 3-94 
currency separator symbols 3-97 
date format 3-102 
description 2-23 
keyboard information 3-91 
language 
description 2-23 
name 2-23 
percentage of population 2-24 
number format 3-99 
sort sequence 
considerations 3-79 
time format 3-106 
weights and measurements 3-107 
time format 
table 3-105, 3-106 
Turkey 
alphabet characteristics 2-24 
character classification 
considerations 3-82 
character table 3-82 
currency 3-94 
currency separator symbols 3-97 
date format 3-102 
description 2-24 
keyboard information 3-91 
language 
description 2-24 
name 2-24 
percentage of population 2-24 
number format 3-99 
sort equal weight characters 3-82 
sort sequence 
considerations 3-82 
time format 3-106 
weights and measurements 3-107 
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UK 


alphabet characteristics 2-25 
character classification 

considerations 3-84 
character table 3-84 
currency 3-94 
currency separator symbols 3-97 
date format 3-103 
description 2-25 
keyboard information 3-91 
language 

description 2-25 

name 2-25 


percentage of population 2-25 


number format 3-99 
sort sequence 

considerations 3-84 
time format 3-106 


unique characteristics of the language 2-25 


weights and measurements 3-107 


uppercase 


caps lock information 3-90 
definition X-2 


USA 


alphabet characteristics 2-26 

character classification 
considerations 3-86 

character table 3-86 

currency 3-95 

currency separator symbols 3-97 

date format 3-103 

description 2-25 

keyboard information 3-91 

language 


description 2-25 
name 2-25 


percentage of population 2-26 


number format 3-99 

sort sequence 
considerations 3-86 

time format 3-106 


unique characteristics of the language 2-26 


weights and measurements 3-107 


user, definition X-2 


weights 3-107 


Yugoslavia 


alphabet characteristics 2-26 
character table 3-88 
currency 3-95 
currency separator symbols 3-97 
date format 3-103 
description 2-26 
keyboard information 3-92 
language 

description 2-26 

name 2-26 


percentage of population 2-26 


number format 3-100 
time format 3-106 
weights and measurements 3-107 


Index 


X-15 
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